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Description 

The present invention relates to a stable enzyme composition comprising a thermostable nucleic acid polymerase 
enzyme. In one embodiment the enzyme in said composition is DNA polymerase purified from Thermus aquaticus and 
5 has a molecular weight of about 66,000-90,000 

The present invention relates to a process for amplrtying existing nucleic acid sequences if they are present in a 
test sample and detecting them rf present by using a probe More specifically, it relates to a process for producing any 
particular nucleic acid sequence from a given sequence of DNA or RNA in amounts which are large compared to the 
amount initially present so as to facilitate detection of the sequences, using a thermostable enzyme to catalyze the 
10 reaction The DNA or RNA may be single- or double-stranded, and may be a relatively pure species or a component 
of a mixture of nucleic acids The process of the invention utilizes a repetitive reaction to accomplish the amplification 
of the desired nucleic acid sequence. 

Extensive research has been conducted on the isolation of DNA polymerases from mesophilic microorganisms 
such as E coli See, for example Bessman et al., J. Biol Chem (1957) 233 :171 -177 and Buttin and Kornberg (1966) 
is J. Biol. Chem. 241 5419-5427 

In contrast, relatively little investigation has been made on the isolation and purification of DNA polymerases from 
thermophiles, such as Thermus aquaticus . Kaledin et al., Biokhymiya (1980) 45:644-651 discloses a six-step isolation 
and purification procedure of DNA polymerase from cells of T aquaticus YT1 strain. These steps involve isolation of 
crude extract, DEAE-cellulose chromatography, fractionation on hydroxyapatite, fractionation on DEAE -cellulose, and 
20 chromatography on single-strand DNA-celiulose. The pools from each stage were not screened for contaminating 
endo- and exonuclease(s). The molecular weight of the purified enzyme is reported as 62,000 daltons per monomeric 
unit 

A second purification scheme for a polymerase from T aquaticus is described by A. Chien et al., J Bacteriol 
(1976) 127 :1550-1557 In this process, the crude extract is applied to a DEAE-Sephadex column. Thedialyzed pooled 

25 fractions are then subjected to treatment on a phosphocellulose column The pooled fractions are dialyzed and bovine 
serum albumin (BSA) is added to prevent loss of polymerase activity The resulting mixture is loaded on a DNA-cellulose 
column. The pooled material from the column is dialyzed and analyzed by gel filtration to have a molecular weight of 
about 63,000 daltons, and, by sucrose gradient centrifugation of about 68 000 daltons 

The use of a thermostable enzyme to amplify existing nucleic acid sequences in amounts that are large compared 

30 to the amount initially present has been suggested in European Pat Pub No 200,362 published December 10, 1986 
Primers, nucleotide triphosphates, and a polymerase are used in the process, which involves denaturation, synthesis 
of template strands and hybridization The extension product of each primer becomes a template for the production of 
the desired nucleic acid sequence. The application discloses that If the polymerase employed is a thermostable onzyme, 
it need not be added after every denaturation step, because the heat will not destroy its activity. No other advantages 

35 or details are provided on the use of a purified thermostable DNA polymerase. The amplification and detection process 
is also described by Saiki et al.. Science , 230:1350-1354 (1985), and by Saiki et al., Bio/Technology , 3:1008-1012 
(1985). 

Accordingly, there is a desire in the art to produce a purified, stable thermostable enzyme that may be used to 
improve the diagnostic amplification process described above 

40 Accordingly, the present invention, provides a purified thermostable enzyme that catalyzes combination of nucle- 

otide triphosphates to form a nucleic acid strand complementary to a nucleic acid template strand. Preferably the 
purified enzyme is DNA polymerase from Thermus aquaticus and has a molecular weight of about 86,000-90,000 
daltons. This purified material may be used in a temperature-cycling amplification reaction wherein nucleic acid se- 
quences are produced from a given nucleic acid sequence in amounts that are large compared to the amount initially 

45 present so that they can be detected easily. 

The gene encoding the enzyme from DNA polymerase from Thermus aquaticus has also been identified and 
provides yet another means to retneve the thermostable enzyme of the present invention In addition to the gene 
encoding the approximately 86-000-90,000 dalton enzyme gene derivatives encoding DNA polymerase activity are 
also presented A gene encoding an onzyme having a molecular weight of about 60,000 to 65,000 daltons cloned from 

50 the genome of Thermus aquaticus is also provided 

The invention also encompasses a stable enzyme composition compnsing a purified thermostable enzyme as 
described above in a buffer containing one or more non-ionic polymeric detergents 

The purified enzyme, as well as the enzymes produced by recombinant DNA techniques, provide much more 
specificity than the Klenow fragment, which is not thermostable. In addition, the purified enzyme and the recombinant ly 

55 produced enzymes exhibit the appropriate actrvrty expected when dTTP or other nucleotide triphosphates are not 
present in the incubation mixture with the DNA template. Also, the enzymes herein have a broader pH profile than that 
of the thermostable enzyme from Thermus aquaticus described in the literature, with more than 50% of the activity at 
pH 7 as at pH 3 In addition, the thermostable enzyme herein can be stored in a buffer with non-ionic detergents so 
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that it is stable, not losing activity over a period of time 

The present invention resides in a process for amplifying one or more specific nucleic acid sequences present in 
a nucleic acid or mixture thereof using primers and a thermostable enzyme. The extension product of one primer when 
hybridized to the other becomes a template for the production of the desired specific nucleic acid sequence, and vice 

5 versa, and the process is repeated as often as is necessary to produce the desired amount of the sequence The 
method herein improves the specificity ot the amplification reaction, resulting in a very distinct signal of amplified nucleic 
acid. In addition, the method herein eliminates the need for transferring reagents from one vessel to another after each 
amplification cycle. Such transferring is not required because the thermostable enzyme will withstand the high tem- 
peratures required to denature the nucleic acid strands and therefore does not need replacement The temperature 

10 cycling may, in addition, be automated for further reduction in manpower and steps required to effectuate the amplifi- 
cation react ion. 

More specifically, the present invention provides a process for amplifying at least one specific nucleic acid sequence 
contained in a nucleic acid or a mixture of nucleic acids, wherein if the nucleic acid is double-stranded, it consists of 
two separated complementary strands of equal or unequal iength, which process comprises: 

15 

(a) contacting each nucleic acid strand with four different nucleotide triphosphates and one oligonucleotide primer 
for each different specific sequence being amplified, wherein each primer is selected to be substantially comple- 
mentary to different strands of each specific sequence, such that the extension product synthesized from one 
primer when it is separated from its complement, can serve as a template for synthesis of the extension product 

20 of the other primer, said contacting being at a temperature which promotes hybridization of each primer to its 

complementary nucleic acid strand; 

(b) contacting each nucleic acid strand at the same time as or after step (a), with a thermostable enzyme which 
enables combination of the nucleotide triphosphates to form primer extension products complementary to each 

25 strand of each nucleic acid; 

(c) maintaining the mixture from step (b) at an effective temperature for an effective time to activate the enzyme, 
and to synthesize, for each different sequence being amplified, an extension product of each primer which is 
complementary to each nucleic acid strand template, but not so high (a temperature) as to separate each extension 

oo product from its complementary strand template; 

(d) heating the mixture from step (c) for an effective time and at an effective temperature to separate the primer 
extension products from the templates on which they were synthesized to produce single-stranded molecules, but 
not so high (a temperature) as to denature irreversibly the enzyme; 

35 

(e) cooling the mixture from step (d) at an effective temperature for an effective time to promote hybridization of 
each primer to each of the single-stranded molecules produced in step (d); and 

(f ) maintaining the mixture from step (e) at an effective temperature for an effective time to promote the activity of 
40 the enzyme and to synthesize, for each different sequence being amplified, an extension product of each primer 

which is complementary to each nucleic acid strand template produced in step (d), but not so high (a temperature) 
as to separate each extension product from its complementary strand template, wherein steps (e) and (f) are 
carried out simultaneously or sequentially. 

4 5 The steps (d), (e) and (f ) may be repeated until the desired level of sequence amplification is obtained. The preferred 

thermostable enzyme herein is a polymerase extracted from Thermus aquaticus (Taq polymerase) Most preferably, if 
the enzyme is Taq polymerase, in step (a) the nucleic acid strands are contacted with a buffer comprising about 1 5-2 
mM of a magnesium salt, 1 50-200 uJvl each of the nucleotides, and 1 u,M of each primer, steps (a), (e) and (f) are 
carried out at about 45-58°C, and step (d) is carried out at about 90-1 00°C 

50 in a preferred embodiment the nucleic acid(s) are double-stranded and step (a) is accomplished by (i) heating 

each nucleic acid in the presence ol four different nucleotide triphosphates and one oligonucleotide primer for each 
different specific sequence being amplified, for an effective time and at an effective temperature to denature each 
nucleic acid wherein each primer is selected to be substantially complementary to different strands of each specific 
sequence such that the extension product synthesized from one primer, whon it is separated from its complement, 

55 can serve as a template for synthesis of the extension product of the other primer and (n) cooling the denatured nucleic 
acids to a temperature which promotes hybridization of each primer to its complementary nucleic acid strand. 

In other embodiments the invention relates to a process for detecting the presence or absence of at least one 
specific nucleic acid soquencc in a sample containing a nucleic acid or mixture of nucleic acids ; or distinguishing 
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between two different sequences in said sample, wherein the sample is suspected of containing said sequence or 
sequences, and wherein if the nucleic acid(s) are double-stranded, they each consist of two separated complementary 
strands of equal or unequal length, which process comprises steps (a) to (f) mentioned above, resulting in amplication 
in quantity of the specific nucleic acid sequence(s), if present; 

5 

(g) adding to the product of step (t) a labeled oligonucleotide probe, for each sequence being detected, capable 
of hybridizing to said sequence or to a mutation thereof; and 

(h) determining whether said hybridization has occurred 

10 

In yet another embodiment, the invention relates to a process for detecting the presence or absence of at least 
one nucleotide variation in sequence in one or more nucleic acids contained in a sample, wherein if the nucleic acid 
is double-stranded it consists of two separated complementary strands of equal or unequal length, which process 
comprises steps (a)-(1) mentioned above wherein steps (d), (e) and (f) are repeated a sufficient number of times to 
'5 result in detectable amplification of the nucleic acid containing the sequence if present; 

(g) affixing the product of step (f) to a membrane; 

(h) treating the membrane under hybridization conditions with a labeled sequence-specific oligonucleotide probe 
20 capable of hybridizing with the amplified nucleic acid sequence only if a sequence of the probe is complementary 

to a region of the amplified sequence; and 

(i) detecting whether the probe has hybridized to an amplified sequence in the nucleic acid sample 

2s if the sample comprises cells, preferably they are heated before step (a) to expose the nucleic acids therein to the 

reagents. This step avoids extraction of the nucleic acids prior to reagent addition 

In a variation of this process, the primer(s) and/or nucleotide triphosphates are labeled so that the resulting am- 
plified sequence is labeled. The labeled primer(s) and/or nucleotide triphosphate(s) can be present in the reaction 
mixture initially or added during a later cycle. The sequence-specific oligonucleotide (unlabeled) is affixed to a mem- 

30 brane and treated under hybridization conditions with the labeled amplification product so that hybridization will occur 
only if the membrane-bound sequence is present in the amplification product. 

In yet another embodiment, the invention herein relates to a process for cloning into a cloning vector one or more 
specific nucleic acid sequences contained in a nucleic acid or a mixture of nucleic acids, which nucleic acid(s) when 
double-stranded consist of two separated complementary strands, and which nucleic acid(s) are amplified in quantity 

35 before cloning, which process comprises steps (a)-(f) mentioned above, with steps (d), (e) and (f) being repeated a 
sufficient number of times to result in detectable amplification of the nucleic acid(s) containing the sequence(s), 

(g) adding to the product of step (f ) a restriction enzyme for each of said restriction sites to obtain cleaved products 
in a restriction digest and 

40 

(h) ligating the cleaved product(s) of step (g) containing the specific sequence(s) to be cloned into one or more 
cloning vectors containing a promoter and a selectable marker 

In a further embodiment, the invention herein relates to a process for cloning into a cloning vector one or more 
45 specific nucleic acid sequences contained in a nucleic acid or mixture of nucleic acids, which nucleic acid(s), when 
double-stranded, consist of two separated complementary strands of equal or unequal length which nucleic acid(s) 
are amplified in quantity before cloning, which process comprises steps (a)-(f) mentioned above, with steps (d) (e) 
and (f) being repeated a sufficient number of times to result in effective amplification of the nucleic acid(s) containing 
the sequence(s) for blunt-end ligation into one or more cloning voctors; and 
so (g) ligating the amplified specrfic sequence(s) to be cloned obtained from step (f) into one or more of said cloning 

vectors in the presence of a ligase, said amplified sequence(s) and vector(s) being present in sufficient amounts to 
effect the ligation. 

In a product embodiment, the invention provides a composition of matter useful in amplifying at least one specific 
nucleic acid sequence contained in a nucleic acid or a mixture of nucleic acids, comprising four differont nucleotide 
triphosphates and one oligonucleotide primer for each different specific sequence being amplified, wherein each primer 
is selected to be substantially complementary to different strands of each specific sequence, such that the extension 
product synthesized from one primer, when it ts separated from its complement, can serve as a template for synthesis 
of the extension product of the other primer 
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In another product embodiment the invention provides a sample of one or more nucleic acids comprising multiple 
strands of a specific nucleic acid sequence contained in the nucleic acid(s). The sample may comprise about 10-100 
of the strands, about 100-1000 of the strands, or over about 1000 of the strands. 

In a further product embodiment, the invention provides an amplified nucleic acid sequence from a nucleic acid or 
s mixlure of nucleic acids comprising multiple copies of the sequence produced by the amplification processes herein. 

Figure 1 is a restriction site map of plasmid pFC83 that contains the —4,5 kb Hind lll T. aquaticus DNA insert 
subcloned into plasmid BSM13+. 

Figure 2 is a restriction site map of plasmid pFC85 that contains the -2.8 kb Hind lll to Asp718 T aquaticus DNA 
insert subcloned into plasmid BSM13+ 

'0 As used herein, "cell", "cell line", and "cell culture" can be used interchangeably and all such designations include 

progeny. Thus, the words transformants" or transformed cells' includes the primary subject cell and cultures derived 
therefrom without regard for the number of transfers It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations Mutant progeny that have the same functionality as 
screened for m the originally transformed cell are included 

'5 The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked coding 

sequence in a particular host organism. Trie control sequences that are suitable for procaryotes, for example, include 
a promoter, optionally an operator sequence, a ribosome binding site, and possibly, other as yet poorly understood 
sequences Eucaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers 

The term "expression system" refers to DNA sequences containing a desired coding sequence and control se- 

20 quences in operable linkage, so that hosts transformed with these sequences are capable of producing the encoded 
proteins. In order to effect transformation, the expression system may be included on a vector; however, the relevant 
DNA may then also be integrated into the host chromosome. 

The term "gene" as used herein refers to a DNA sequence that encodes a recoverable bioactive polypeptide or 
precursor. The polypeptide can be encoded by a full-length gene sequence or any portion of the coding sequence so 

25 long as the enzymatic activity is retained. 

"Operably linked" refers to juxtaposition such that the normal function of the components can be performed Thus, 
a coding sequence "operably linked' to control sequences refers to a configuration wherein the coding sequences can 
be expressed under the control of the sequences. 

'Non- ionic polymeric detergents" relers to surface-active agents that have no tonic charge and that are charactor- 

30 jzed, for purposes of this invention, by their ability to stabilize the enzyme herein at a pH range of from about 3 5 to 
about 9 5, preferably from 4 to 8.5. 

The term "oligonucleotide" as used herein is defined as a molecule comprised of two or more deoxyribonucleotides 
or ribonucleotides, preferably more than three. Its exact size will depend on many factors, which in turn dopend on the 
ultimate function or use of the oligonucleotide. The oligonucleotide may be derived synthetically or by cloning 

35 The term "primer" as used herein refers to an oligonucleotide, whether occurring naturally as in a purified restriction 

digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under 
conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced, 
i.e.. in the presence ol four different nucleotide triphosphates and thermostable enzyme in an appropriate buffer ("buffer" 
includes pH, ionic strength, cofactors, etc.) and at a suitable temperature For Taq polymerase the buffer herein pref- 

40 erably contains 1.5-2 mM of a magnesium salt, preferably MgC^, 150-200 u.M of each nucleotide, and 1 u.M of each 
primer, along with preferably 50 mM KCI 10 mM Tris buffer, pH 8-8.4, and 100 (icyml gelatin. 

The primer is preferably single-stranded for maximum efficiency in amplification, but may alternatively be double- 
stranded. If double-stranded, the primer is first treated to separate its strands before being used to prepare extension 
products. Preferably, the primer is an oligodeoxy ribonucleotide. The primer must be sufficiently long to prime the syn- 

45 thesis of extension products in the presence of the thermostable enzyme. The exact lengths of the primers will depend 
on many factors, including temperature, source of primer and use of the method For example, depending on the 
complexity of the target sequence, the oligonucleotide primer typically contains 15-25 nucleotides, although it may 
contain more or fewer nucleotides. Short primer molecules generally require cooler temperatures to form sufficiently 
stable hybrid complexes with template 

so The primers herein are selected to be "substantially" complementary to the different strands of each specific se- 

quence to be amplified This means that the primers must be sufficiently complementary to hybridize with their respec- 
tive strands. Therefore, the primer sequence need not reflect the exact sequence of the template. For example, a non- 
complementary nucleotide fragment may be attached to the 5' end of the primer, with the remainder of the primer 
sequence being complementary to the strand. Alternatively, non -complementary bases or longer sequences can be 

55 interspersed into the primer, provided that the primer sequence has sufficient complementarity with the sequence of 
the strand to be amplified to hybridize therewith and thereby form a template for synthesis of the extension product of 
the other prtmer. However for detection purposes, partculary using labeled sequence -specific probes, the primers 
typically have exact complementarity to obtain the best rosults 
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As used herein, the terms 'restriction endonucleases" and "restriction enzymes" refer to bacterial enzymes each 
of which cut double-stranded DNA at or near a specific nucleotide sequence. 

As used herein, the term "DNA polymorphism" refers to the condition in which two or more different nucleotide 
sequences can exist at a particular site in DNA. 

5 As used herein, the term "nucleotide variation in sequence" refers to any single or multiple nucleotide substitutions, 

deletions or insertions These nucleotide variations may be mutant or polymorphic allele variations Therefore, the 
process herein can detect single nucleotide changes in nucleic acids such as occur in (J-globin genetic diseases caused 
by single-base mutations, additions or deletions (some p-thalassemias, sickle cell anemia, hemoglobin C disease 
etc ), as well as multiple-base variations such as are involved with ot-thalassemia or some ^-thalassemias In addition, 

10 the process herein can detect polymorphisms, which are not necessarily associated with a disease, but are merely a 
condition in which two or more different nucleotide sequences (whether having substituted, deleted or inserted nucle- 
otide base pairs) can exist at a particular site in the nucleic acid in the population, as with HLA regions of the human 
genome and random polymorphisms such as mitochondrial DNA. The polymorphic sequence-specific oligonucleotide 
probes described in detail hereinafter may be used to detect genetic markers linked to a disease such as insulin- 

is dependent diabetis mellitus or in forensic applications. If the nucleic acid is double-stranded, the nucleotide variation 
in sequence becomes a base pair variation in sequence. 

The term "sequence-specific oligonucleotides" refers to oligonucleotides which will hybridize to specific sequences 
whether or not contained on alleles, which sequences span the base pair variation being detected and are specific for 
the sequence variation being detected. Depending on the sequences being analyzed, one or more sequence-specific 

20 oligonucleotides may be employed for each sequence, as described further hereinbelow. 

As used herein, the the term "restriction fragment length polymorphism" ("RFLP") refers to the differences among 
individuals in the lengths of restriction fragments formed by digestion with a particular restriction endonuclease. 

As used herein, the term thermostable enzyme" refers to an enzyme which is stable to heat and is heat resistant 
and catalyzes (facilitates) combination of the nucleotides in the proper manner to form the primer extension products 

25 that are complementary to each nucleic acid strand. Generally, the synthesis will be initiated at the 3' end of each 
primer and will proceed in the 5* direction along the template strand, until synthesis terminates, producing molecules 
of different lengths. There may be a thermostable enzyme, however, which initiates synthesis at the 5* end and proceeds 
in the other direction, using the same process as described above 

The thermostable enzyme herein must satisfy a single criterion to be effective for the amplification reaction, i.e 

30 the enzyme must not become irreversibly denatured (inactivated) when subjected to the elevated temperatures for the 
time necessary to effect denaturation of double-stranded nucleic acids. Irreversible denaturation lor purposes herein 
refers to permanent and complete toss of enzymatic activity. The heating conditions necessary for denaturation will 
depend, e.g., on the buffer salt concentration and the length and nucleotide composition of the nucleic acids being 
denatured, but typically range from about 90 to about 105°C for a time depending mainly on the temperature and the 

35 nucleic acid length, typically about 0,5 to four minutes. Higher temperatures may be tolerated as the buffer salt con- 
centration and/or GC composition of the nucleic acid is increased. Preferably, the enzyme will not become irreversibly 
denatured at about 90-1 00°C. 

The thermostable enzyme herein preferably has an optimum temperature at which it functions that is higher than 
about 40 3 C , which is the temperature below which hybridization of primer to template is promoted, although, depending 

^0 on (1 ) magnesium and salt concentrations and (2) composition and length of primer, hybridization can occur at higher 
temperature (e.g. ; 45-70°C). The higher the temperature optimum for the enzyme, the greater the specificity and/or 
selectivity of the primer-directed extension process However, enzymes that are active below 40° C, e g , at 37°C, are 
also within the scope of this invention provided they are heat-stable. Preferably, the optimum temperature ranges from 
about 50 to 90°C, more preferably 60-80°C. 

^5 The thermostable enzyme herein may be obtained from any source and may be a native or recombinant protein. 

Examples of enzymes that have been reported in the literature as being resistant to heat include heat -stable polymer- 
ases, such as, e g., polymerases extracted from the thermophilic bacteria Thermus flavus , Thermus ruber , Thermus 
thermophilus , Bacillus stearothermophitus (which has a somewhat tower temperature optimum than the others listed), 
Thermus aquaticus, Thermus lacteus , Thermus rubens , and Methanothermus fervidus . 

so The preferred thermostable enzyme herein is a DNA polymerase isolated from Thermus aquaticus Various strains 

thereof are available from the American Type Culture Collection, Rockville, Maryland, and is described by TD Brock. 
J. Bact. (1969) 98:289-297, and by T. Oshima, Arch, Microbiol . (1976) V17_ 189-196. One of these preferred strains is 
strain YT-1. 

For recovering the native protein the cells are grown using any suitable technique. One such technique is described 
55 by Kaledm et al Blokhimiya (1 980), supra Briefly, the cells are grown on a medium, m one liter, of nrtnlotnacetic acid 
(100 mg), tryptone (3g), yeast extract (3 g) succinic acid (5 g), sodium sulfite (50 mg), riboflavin (1 mg), K2HP0 4 (522 
mg), MgSO 4 (480 mg). CaCI 2 (222 mg), NaCI (20 mg). and trace elements. The pH of the medium is adjusted to 8.0 ± 
0.2 with KOH The yield is increased if cultivated with vigorous aeration up to 20 g/liter of cells at a temperature of 
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70°C Cells in the late logarithmic stage (determined by absorbance at 550 nm) are collected by cenintugation, washed 
wrth a butler and stored frozen at -20 3 C 

In another method for growing the cells, described in Chien et al., J. Bacterid (1976), supra , a defined mineral 
salts medium containing 0.3% glutamic acid supplemented with 0. 1 mg/l biotin, 0. 1 mg/1 thiamine, and 0 05 mg/l nicotinic 
5 acid is employed The salts include nitnlotriacetic acid, CaS0 4 , MgS0 4 , NaCl, KN0 3 , NaN0 3 , ZnS0 4 , H 3 B0 3 , CuS0 4 
NaMcC 4 , C0CI2, FeCI 3 . MnS0 4 , and Na 2 HP0 4 The pH of the medium is adjusted to 8 0 with NaOH 

In the Chien el al. technique, the cells are grown initially at 75°C in a water bath shaker On reaching a certain 
density, 1 liter ot these cells is transferred to 16-liter carboys which are placed in hot-air incubators Sterile air is bubbled 
through the cultures and the temperature maintained at 75°C The cells are allowed to grow for 20 hours before being 
10 collected by centrifuge 

After cell growth, the isolation and purification of the enzyme take place in six stages, each of which is carried out 
at a temperature below room temperature, preferably about 4°C. 

In the first stage or step, the cells, if frozen, are thawed, disintegrated by ultrasound, suspended in a buffer at 
about pH 7 5, and centntuged. 

'5 in the second stage, the supernatant is collected and then fractionated by adding a salt such as dry ammonium 

sulfate. The appropriate fraction (typically 45-75% of saturation) is collected, dissolved in a 0.2 M potassium phosphate 
buffer preferably at pH 6 5, and dialyzed against the same buffer. 

The third step removes nucleic acids and some protein. The fraction from the second stage is applied to a DEAE- 
cellulose column equilibrated with the same buffer as used above. Then the column is washed with the same buffer 

20 and the flow-through protein-containing fractions, determined by absorbance at 280 nm, are collected and dialyzed 
against a 1 0 mM potassium phosphate buffer, preferably with the same ingredients as the first buffer, but at a pH of 7.5 
In the fourth step, the fraction so collected is applied to a hydroxy apatite column equilibrated with the buffer used 
for dialysis in the third step. The column is then washed and the enzyme eluted with a linear gradient of a buffer such 
as 0.01 M to 0.5 M potassium phosphate buffer at pH 7.5 containing 10 mM 2-mercaptoethanol and 5% glycerine. The 

25 pooled fractions containing thermostable enzyme (e.g., DNA polymerase) activity are dialyzed against the same buffer 
used for dialysis in the third step. 

In the fifth stage ; the dialyzed fraction is applied to a DEAE-cellulose column, equilibrated with the buffer used for 
dialysis in the third step. The column is then washed and the enzyme eluted with a linear gradient of a buffer such as 
0 01 to 0.6 M KCI in the buffer used for dialysis in the third step. Fractions with thermostable enzyme activity are then 

30 tested for contaminating deoxyribonucleases (endo- and exonucleases) using any suitable procedure For example, 
the endonuclease activity may be determined electrophoreticalty from the change in molecular weight of phage X DNA 
or supercoiled plasmid DNA after incubation with an excess of DNA polymerase. Similarly, exonuclease activity may 
be determined elect rophoretically from the change in molecular weight of DNA after treatment with a restriction enzyme 
that cleaves at several sites 

05 The fractions determined to have no deoxyribonuclease activity are pooled and diafyzed against the same buffer 

used in the third step 

In the sixth step, the pooled fractions are placed on a phosphocellulose column with a set bed volume. The column 
is washed and the enzyme eluted with a linear gradient of a buffer such as 0 01 to 0 4 M KCI in a potassium phosphate 
buffer at pH 7.5 The pooled fractions having thermostable polymerase activity and no deoxyribonuclease activity are 

40 dialyzed against a buffer at pH 8.0. 

The molecular weight of the dialyzed product may be determined by any technique, for example, by SDS PAGE 
using protein molecular weight markers The molecular weight of one of the preferred enzymes herein, the DNA 
polymerase purified from Therm us aquaticus, is determined by the above method to be about 86,000-90,000 daltons 
The thermostable enzyme of this invention may also be produced by recombinant DNA techniques, as the gene 

45 encoding this enzyme has been cloned from Thermus aquaticus genomic DNA. The complete coding sequence for 
the Thermus aquaticus (Taq) polymerase can be derived from bacteriophage CH35:Taq#4-2 on an approximately 3 5 
kilobase (kb) Bglll-Asp71 8 (partial) restriction fragment contained wrthin an -18 kb genomic DNA insert fragment This 
bacteriophage was deposited with the American Type Culture Collection (ATCC) on May 29, 1987 and has accession 
no. 40.336 Alternatively, the gone can be constructed by ligating an -750 base pair (bp) Bqlll -Htnd lll restriction frag- 

so ment isolated from plasmid pFC83 (ATCC 67,422 deposited May 29. 1987) to an -2 8 kb Hind lll-Asp718 restriction 
fragment isolated from plasmid pFC85 {ATCC 67 421 deposited May 29 1987) The pFC83 restriction fragment com- 
prises the amino-tenminus of the Taq polymerase gene while the restriction fragment from pFC85 comprises the car- 
boxyl-terminus Thus, ligation of these two fragments into a correspondingly digested vector with appropriate control 
sequences will result in the translation of a full-length Taq polymerase. 

55 it has also been found that the entire coding sequence of the Taq polymerase gene is not required to recover a 

biologically active gene product with the desired enzymatic activity Amino-terminal deletions wherein approximately 
one-lhird of the coding sequence is absent have resulted in producing a gene product that is quite active m polymerase 
assays 
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In addition to the N-termmal deletions individual amino acid residues in the peptide chain comprising Taq polymer- 
ase may be modified by oxidation, reduction, or other derealization, and the protein may be cleaved to obtain fragments 
that retain activity. Such alterations that do not destroy activity do not remove the DNA sequence encoding such protein 
from the definition of gene. 

5 Thus, modifications to the primary structure itself by deletion, addition, or alteration of the amino acids incorporated 

into the sequence during translation can be made without destroying the activity of the protein Such substitutions or 
other alterations result in proteins having an amino acid sequence encoded by DNA falling within the contemplated 
scope of the present invention. 

Polyclonal antiserum from rabbits immunized with the purified 86,000-90,000 dalton polymerase of this invention 
10 was used to probe a Thermus aquatic us partial genomic expression library to obtain the appropriate coding sequence 
as described below The cloned genomic sequence can be expressed as a fusion polypeptide, expressed directly using 
its own control sequences, or expressed by constructions using control sequences appropriate to the particular host 
used for expression of the enzyme 

Of course, the availability of DNA encoding these sequences provides the opportunity to modify the codon se- 
ts quence so as to generate mutein forms also having DNA polymerase activity. 

Thus, these tools can provide the complete coding sequence for Taq DNA polymerase from which expression 
vectors applicable to a variety of host systems can be constructed and the coding sequence expressed. It is also 
evident from the foregoing that portions of the Taq polymerase-encoding sequence are useful as probes to retrieve 
other thermostable polymerase-encoding sequences in a variety of species. Accordingly, portions of the genomic DNA 
20 encoding at least six amino acids can be replicated in E. coli and the denatured forms used as probes or oligodeox- 
yribonucleotide probes can be synthesized which encode at least six amino acids and used to retrieve additional DNAs 
encoding a thermostable polymerase Because there may not be a precisely exact match between the nucleotide 
sequence in the Thermus aquaticus form and that in the corresponding portion of other species, oligomers containing 
approximately 18 nucleotides (encoding the six amino acid stretch) are probably necessary to obtain hybridization 
2s under conditions of sufficient stringency to eliminate lalse positives. The sequences encoding six amino acids would 
supply information sufficient for such probes 

Suitable Hosts, Control Systems and Methods 

oo In general terms, the production of a recombinant form of Taq polymerase typically involves the following 

First, a DNA is obtained that encodes the mature (used here to include all muteins) enzyme or a fusion of the Taq 
polymerase to an additional sequence that does not destroy its activity or to an additional sequence cleavable under 
controlled conditions (such as treatment with peptidase) to give an active protein If the sequence is uninterrupted by 
mtrons it is suitable for expression in any host This sequence should be in an excisable and recoverable form. 

35 The excised or recovered coding sequence is then preferably placed in operable linkage with suitable control 

sequences in a replicable expression vector. The vector is used to transform a suitable host and the transformed host 
cultured under favorable conditions to effect the production of the recombinant Taq polymerase. Optionally the Taq 
polymerase is isolated from the medium or from the cells; recovery and purification of the protein may not be necessary 
in some instances where some impurities may be tolerated. 

40 Each of the foregoing steps can be done in a variety of ways. For example, the desired coding sequences may 

be obtained from genomic fragments and used directly in appropriate hosts. The constructions for expression vectors 
operable in a variety of hosts are made using appropriate replicons and control sequences, as set forth below Suitable 
restriction sites can, if not normally available, be added to the ends of the coding sequence so as to provide an excisable 
gene to insert into these vectors. 

45 The control sequences expression vectors, and transformation methods are dependent on the type of host cell 

used to express the gene Generally, procaryotic yeast, insect or mammalian cells are presently useful as hosts 
Procaryotic hosts are in general the most efficient and convenient for the production of recombinant proteins and 
therefore preferred for the expression of Taq polymerase. 

In tho particular case of Taq polymerase, evidence indicates that considerable deletion at the N-terminus of the 

so protein may occur under both recombinant and native conditions, and that the activity of the protein is still retained It 
appears that the native proteins isolated may be the result of proteolytic degradation, and not translation of a truncated 
gene The mutein produced from the truncated gene of plasmid pFC85 is, however fully active in assays for DNA 
polymerase, as is that produced from DNA encoding the full-length sequence Since it is clear that certain N-terminal 
shortened forms are active the gone constructs usod for expression of tho polymerase may also include the corre- 

55 spondmg shortened forms of the coding sequence 
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Control Sequences and Corresponding Hosts 

Procaryotes most frequently are represented by various strains of E. coli. However, other microbial strains may 
also be used, such as bacilli, for example. Bacillus subtiiis, various species of Pseudomonas, or other bacterial strains 

s In such procaryotic systems, plasmid vectors that contain replication sites and control sequences derived from a species 
compatible with the host are used For example, E_ coli is typically transformed using derivatives of pBR322, a plasmid 
derived from an E, coli species by Bolivar et al., Gene (1977) 2:95. pBR322 contains genes for ampicillin and tetra- 
cycline resistance, and thus provides additional markers that can be either retained or destroyed in constructing the 
desired vector Commonly used procaryotic control sequences, which are defined herein to include promoters for tran- 

io scnption initiation optionally with an operator, along with ribosome binding site sequences, include such commonly 
used promoters as the p-lactamase (penicillinase) and lactose (lac) promoter systems (Chang, et al., Nature (1977) 
.198 1056), the tryptophan (trp) promoter system (Goeddel, et al., Nucleic Acids Res. (1980) 8:4057) and the lambda- 
derived P L promoter (Shimatake, et al., Nature (1981) 292:128) and N-gene ribosome binding site, which has been 
made useful as a portable control cassette which comprises a first DNA sequence that is the P L promoter operably 

f5 linked to a second DNA sequence corresponding to N RBS upstream of a third DNA sequence having at least one 
restriction site that permits cleavage within six bp 3' of the N RBS sequence. Also useful is the phosphatase A (phoA) 
system described by Chang, et al. in European Patent Publication No. 196,864 published October 8, 1986, assigned 
to the same assignee However, any available promoter system compatible with procaryotes can be used. 

In addition to bacteria, eucaryotic microbes, such as yeast, may also be used as hosts. Laboratory strains of 

20 Saccharomvces cerevisiae, Baker's yeast, are most used, although a number of other strains are commonly available 
While vectors employing the 2 micron origin of replication are illustrated (Broach, J R.. Meth. Enz. (1983) 101:307), 
other plasmid vectors suitable for yeast expression are known (see, for example, Stinchcomb, et al., Nature (1979) 
282 39, Tschempe, et al., Gene (1980) 10:157 and Clarke, L, et al., Meth Enz. (1983) 101:300). Control sequences 
for yeast vectors include promoters for the synthesis of glycolytic enzymes (Hess, et al., J. Adv. Enzyme Reg. (1968) 

25 7 149; Holland, et al.. Biotechnology (1978) 17:4900). 

Additional promoters known in the art include the promoter for 3-phosphoglycerate kinase (Hitzeman, et al., J_ 
Biol. Chem. (1 980) 255:2073), and those for other glycolytic enzymes, such as glyceraldehyde-3-phosphate dehydro- 
genase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglyc- 
erate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. Other pro- 

30 moters that have the additional advantage of transcription controlled by growth conditions are the promoter regions 
for alcohol dehydrogenase 2, isocytochrome C t acid phosphatase, degradative enzymes associated with nitrogen 
metabolism, and enzymes responsible for maltose and galactose Utilization (Holland, supra ). 

It is also believed that terminator sequences are desirable at the 3' end of the coding sequences Such terminators 
are found in the 3* untranslated region following the coding sequences in yeast-derived genes Many of the vectors 

35 illustrated contain control sequences derived from the enolase gene containing plasmid peno46 (Holland, M. J., et al. , 
J Biol. Chem. (1981)256:1385) or the LEU2 gene obtained from YEp1 3 (Broach, J., et al . Gene (1978) 8:121), how- 
ever, any vector containing a yeast-compatible promoter, origin of replication, and other control sequences is suitable 
It is also, of course, possible to express genes encoding polypeptides in eucaryotic host cell cultures derived from 
multicellular organisms See, for example, Tissue Culture, Academic Press, Cruz and Patterson, editors ( 1 973) Useful 

40 host cell lines include murine myelomas N51 , VERO and HeLa cells, and Chinese hamster ovary (CHO) cells Expres- 
sion vectors for such cells ordinarily include promoters and control sequences compatible with mammalian cells such 
as, for example, the commonly used early and late promoters from Simian Virus 40 (SV 40) (Fiers, et al., Nature (1 978) 
273 : 113), or other viral promoters such as those derived from polyoma, Adenovirus 2, bovine papiloma virus, or avian 
sarcoma viruses, or immunoglobulin promoters and heat shock promoters. A system for expressing DNA in mammalian 

45 systems using the BPV as a vector is disclosed in U.S. Patent 4 419,446 A modification of this system is described 
in U S Patent 4,601,978 General aspects of mammalian cell host system transformations have been described by 
Axel U S Patent No 4 399,216 It now appears, also, that "enhancer" regions are important in optimizing expression; 
these are, generally, sequences found upstream of the promoter region. Origins of replication may be obtained, if 
needed, from viral sources However, integration into the chromosome is a common mechanism for DNA replication 

50 in eucaryotes 

Plant cells are also now available as hosts, and control sequences compatible with plant cells such as the nopaline 
synthase promoter and potyadenylation signal sequences (Depicker, A., et al , J Mol Appl. Gen. (1982) T561 ) are 
available. 

Recently, in addition expression systems employing insect cells utilizing the control systems provided by baculo 
55 virus vectors have been described (Miller. D W , et al., m Genetic Engineering ( 1 986) Set low, J K et al , eds . Plenum 
Publishing, Vol. 6, pp. 277-297) These systems are also successful in producing Taq polymerase. 
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Transformations 

Depending on the host cell used, transformation is done using standard techniques appropriate to such cells. The 
calcium treatment employing calcium chloride, as described by Cohen, S. N Proc. Natl. Acad Sci. (USA) (1972) 69 
5 2110 ts used for procaryotes or other cells that contain substantial cell wall barriers Infection with Aqrobacterium 
tumefaciens (Shaw, C H ,et al. , Gene (1 983) 23 31 5) is used for certain plant cells For mammalian cells without such 
cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology (1 973) 52:546 is preferred 
Transformations into yeast are carried out according to the method of Van Solingen, P., et al., J. Sact. (1977) 130:946 
and Hsiao, C L , et al , Proc Natl Acad Sci (USA) (1979) 76:3829 

w 

Construction of a Xcjt 1 1 Expression Library 

The strategy for isolating DNA encoding desired proteins, such as the Taq polymerase encoding DNA, using the 
bacteriophage vector lambda gt11 , is as follows A library can be constructed of Eco RI -flanked Alul fragments, gener- 

'5 ated by complete digestion of Thermus aquaticus DNA, inserted at the Eco RI site in the lambda gt11 phage (Young 
and Davis, Proc Natl. Acad. Sci USA (1983) 80:1194-1198). Because the unique EcoRI site in this bacteriophage is 
located in the carboxyl-terminus of the p-galactosidase gene, inserted DNA (in the appropriate frame and orientation) 
is expressed as protein fused with p-galactosidase under the control of the lactose operon promoter/operator 

Genomic expression libraries are then screened using the antibody plaque hybridization procedure A modification 

20 of this procedure, referred to as "epitope selection," uses antiserum against the fusion protein sequence encoded by 
the phage, to confirm the identification of hybridized plaques. Thus, this library of recombinant phages could be 
screened with antibodies that recognize the 86,000-90,000 datton Taq polymerase in order to identify phage that carry 
DNA segments encoding the antigenic determinants of this protein. 

Approximately 2 x 10 5 recombinant phage are screened using total rabbit Taq polymerase antiserum. In this primary 

2s screen, positive signals are detected and one or more of these plaques are purified from candidate plaques which 
failed to react with preimmune serum and reacted with immune serum and analyzed in some detail To examine the 
fusion proteins produced by the recombinant phage, rysogens of the phage in the host Y1089 are produced. Upon 
induction of the rysogens and gel electrophoresis of the resulting proteins, each lysogen may be observed lo produce 
a now protein, not found in tine other lysogens, or duplicate sequences may result. Phage containing positive signals 

30 are picked; in this case, one positive plaque was picked for further identification and replated at lower densities to purify 
recombinants and the purified clones were analyzed by size class via digestion with Eco RI restriction enzyme Probes 
can then be made of the isolated DNA insert sequences and labeled appropriately and these probes can be used in 
conventional colony or plaque hybridization assays described in Maniatis et al., Molecular Cloning: A Laboratory Manual 
(1982) 

os The labeled probe was used to probe a second genomic library constructed in a Charon 35 bacteriophage (Wil- 

helmine, A M. et al., Gene (1983) 26 171-179). This library was made from Sau3A partial digestions of genomic Ther- 
mus aquaticus DNA and size fractionated fragments (15-20 kb) were cloned into the Bam HI site of the Charon 35 
phage The probe was used to isolate phage containing DNA encoding the Taq polymerase One of the resulting phage, 
designated CH35 Taq#4-2, was found to contain the entire gene sequence Partial sequences encoding portions of 

40 the protein were also isolated. 

Vector Construction 

Construction of suitable vectors containing the desired coding and control sequences employs standard ligation 
45 and restriction techniques that are well understood in the art Isolated plasmids, DNA sequences, or synthesized oli- 
gonucleotides are cleaved, tailored, and religated in the form desired 

Site-specific DNA cleavage is performed by treating with the suitable restriction enzyme (or enzymes) under con- 
ditions that are generally understood in the art, and the particulars of which are specified by the manufacturer of these 
commercially available restriction enzymes. See, e g , New England Biolabs, Product Catalog. In general, about 1 jig 
so of plasmid or DNA sequence is cleaved by one unit of enzyme in about 20 uJ of buffer solution; in the examples herein, 
typically an excess of restriction enzyme ts used to ensure complete digestion of the DNA substrate Incubation times 
of about one hour to two hours at about 37° C are workable, although variations can be tolerated. After each incubation 
protein is removed by extraction with phenol/chloroform, and may be followed by ether extraction, and the nucleic acid 
recovered from aqueous fractions by precipitation with ethanol If desired, size separation of the cleaved fragments 
55 may be performed by polyacrylamide gel or agarose gel electrophoresis using standard techniques A general descrip- 
tion of size separations is found in Methods in Enzymology ( 1 980) 65.499-560. 

Restriction<leaved fragments may be blunt-ended by treating with the large fragment of E. coli DNA polymerase 
I (Klenow) in the presence of the four deoxynucleotide triphosphates (dNTPs) using incubation times of about 15 to 



10 



EP 0 776 970 A1 



25 minutes at 20 to 25°C in 50 mM Trts pH 7 6, 50 mM NaCI, 10 mM MgClg 10 mM DTT and 50-100 uM dNTPs. The 
Kienow fragment fills in at 5' sticky ends, but chews back protruding 3' single strands, even though the four dNTPs are 
present If desired, selective repair can be performed by supplying only one of the, or selected, dNTPs within the 
limitations dictated by the nature of the sticky ends. After treatment with Kienow, the mixture is extracted with phenol/ 
s chloroform and ethanol precipitated. Treatment under appropriate conditions with S1 nuclease results in hydrolysis of 
any single-stranded portion 

Synthetic oligonucleotides may be prepared using the triester method of Matteucci, et al , (J. Am. Chem. See 
(1981) J_03:31 65-31 91) or using automated synthesis methods. Kinasing of single strands prior to annealing or for 
labeling is achieved using an excess, e.g., approximately 10 units of polynucleotide kinase to 1 nM substrate in the 
10 presence of 50 mM Trts, pH 7 6 10 mM MgC^ 5 mM dithiothreitol, 1-2 mM ATP If kinasing is for labeling of probe, 
the ATP will contain high specific activity y- 32 P 

Ligations are performed in 15-30 u.l voiumes under the following standard conditions and temperatures: 20 mM 
Tris-CI pH 7 5, 1 0 mM MgC l 2 , 1 0 mM DTT, 33 ug/ml BSA, 1 0 mM-50 mM NaCI, and e ither 40 u.M ATP, 0. 0 1 -0.02 (Weiss) 
units T4 DNA ligase at 0°C (for "sticky end' ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA Itgase at 14°C (for 
75 "blunt end" ligation). Intermodular "sticky end" ligations are usually performed at 33-100 u,g/ml total DNA concentra- 
tions (5-100 nM total end concentration). Intermolecular blunt end ligations (usually employing a 10-30 fold molar 
excess of linkers) are performed at 1 |iM total ends concentration. 

In vector construction employing "vector fragments', the vector fragment is commonly treated with bacterial alkaline 
phosphatase (BAP) in order to remove the 5' phosphate and prevent religatbn of the vector BAP digestions are con- 
20 ducted at pH 8 in approximately 150 mM Tris, in the presence of Na+ and Mg+ 2 using about 1 unit of BAP per mg of 
vector at 60°C for about one hour. In order to recover the nucleic acid fragments, the preparation is extracted with 
phenol/chloroform and ethanol precipitated Alternatively, religation can be prevented in vectors that have been double 
digested by additional restriction enzyme digestion ol the unwanted fragments. 

25 Modification of DNA Sequences 

For portions of vectors derived from cDNA or genomic DNA that require sequence modifications, site-specific 
primer-directed mutagenesis is used. This technique is now standard in the art and is conducted using a primer syn- 
thetic oligonucleotide complementary to a single-stranded phage DNA to be mutagenized except for limited mismatch- 
30 ing, representing the desired mutation Briefly, the synthetic oligonucleotide is used as a primer to direct synthesis of 
a strand complementary to the phage, and the resulting double-stranded DNA is transformed into a phage-supporting 
host bacterium. Cultures of the transformed bacteria are plated in top agar, permitting plaque formation from single 
cells that harbor the phage. 

Theoretically. 50% of the new plaques will contain the phage having, as a single strand, the mutated form; 50% 
05 will have the original sequence The plaques are transferred to nitrocellulose filters and the "lifts" hybridized with kinased 
synthetic primer at a temperature that permits hybridization of an exact match, but at which the mismatches with the 
original strand aro sufficient to prevent hybridization. Plaques that hybridize with the probe are then picked and cultured, 
and the DNA is recovered 

40 Verification of Construction 

In the constructions set forth below, correct ligations for plasmid construction are confirmed by first transforming 
E_ coh strain MM294, or other suitable host with the ligation mixture Successful transtormants are selected by amp- 
icillin, tetracycline or other antibiotic resistance or using other markers, depending on the mode of plasmid construction, 
^5 as is understood in the art Plasmids from the transformants are then prepared according to the method of Clewell, D 
B , et al., Proc Natt Acad Sci (USA) (1969) 62:1159, optionally following chloramphenicol amplification (Clewell, D 
B , J Bacterid (1 972) VH) 667) The isolated DNA is analyzed by restriction and/or sequenced by the dideoxy method 
of Sanger, F , et al., Proc Natl. Acad. Sci (USA) (1 977) 74 5463 as further described by Messing, et al., Nucleic Acids 
Ros. (1981) 9:309. or by the mothod of Maxam, et al., Methods in Enzymoloqy (1980) 65:499. 

so 

Host Exemplified 

Host strains used in cloning and expression herein are as follows 

For cloning and sequencing, and for expression of constructions under control of most bacterial promoters, E coli 
55 strain MM294 obtained from E_ coh Genetic Stock Center GCSC #61 35, was used as the host For expression under 
control of the P L N RBS promoter _E_ coli strain K12 MC1000 lambda lysogen, N 7 N &3 c!857 SusPgo, ATCC 39531 may 
be used Used herein is E_ coli DG116 which was deposited with ATCC (ATCC 53606) on April 7, 1937. 

For Ml 3 phage recombinants, E_ coli strains susceptible to phage infection, such as E, coli Ki2 strain DG98, are 
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employed The DG96 strain has been deposited with ATCC July 13, 1984 and has accession number 39768 

Mammalian expression can be accomplished in COS-7 COS-A2, CV-1, and murine cells and insect cell based 
expression in Spodoptera frugipeida ). 

5 Stabilization of Enzyme Activity 

For long-term stability, the enzyme herein must be stored in a buffer that contains one or more non-ionic polymeric 
detergents. Such detergents are generally those that have a molecular weight in the range of approximately 100 to 
250,000, preferably about 4,000 to 200,000 daltons and stabilize the enzyme at a pH of from about 3 5 to about 9 5 

10 preferably from about 4 to 8 5 Examples of such detergents include those specrfied on pages 295-298 of McCutcheon's 
Emulsrfiers & Detergents. North American edition (1983) published by the McCutcheon Division of MC Publishing Co., 
175 Rock Road, Glen Rock, NJ (USA), the entire disclosure of which is incorporated herein by reference. Preferably 
the detergents are selected from the group comprising ethoxylated fatty alcohol ethers and lauryl ethers, ethoxylated 
alkyl phenols, octylphenoxy polyethoxy ethanol compounds, modified oxyethylated and/or oxypropylated straight-chain 

'5 alcohols, polyethylene glycol monooleate compounds, polysorbate compounds, and phenolic fatty alcohol ethers. More 
particularly preferred are Tween 20, from ICI Americas Inc., Wilmington, DE. which is a polyoxyethylated (20) sorbitan 
monolaurate, and Iconol™ NP-40, from BASF Wyandotte Corp. Parsippany, NJ, wtiich is an ethoxylated alkyl phenol 
(nonyl) 

The thermostable enzyme of this invention may be used for any purpose in which such enzyme is necessary or 
20 desirable In a particularly preferred embodiment, the enzyme herein is employed in the amplification protocol set forth 
below 

Amplification Protocol 

25 The amplification protocol using the enzyme of the present invention may be the process for amplifying existing 

nucleic acid sequences that is disclosed and claimed in European Pat Pub No. 200,362 ; supra . Preferably, however 
the enzyme herein is used in the amplification process described below. 

In general, the amplification process involves a chain reaction for producing, in exponential quantities relative to 
the number of reaction steps involved, at least one specific nucleic acid sequence given (a) that the ends of the required 

30 sequence are known in sufficient detail that oligonucleotides can be synthesized which will hybridize to them, and (b) 
that a small amount of the sequence is available to initiate the chain reaction The product of the chain reaction will be 
a discrete nucleic acid duplex with termini corresponding to the ends of the specific primers employed. 

Any nucleic acid sequence, in purified or nonpurified form, can be utilized as the starting nucleic acid(s), provided 
it contains or is suspected to contain the specific nucleic acid sequence desired Thus, the process may employ, for 

05 example, DNA or RNA, including messenger RNA, which DNA or RNA may be single-stranded or double -stranded. In 
addition, a DNA-RNA hybrid which contains one strand of each may be utilized. A mixture of any of these nucleic acids 
may also be employed, or the nucleic acids produced from a previous amplification reaction herein using the same or 
different primers may be so utilized The specific nucleic acid sequence to be amplified may be only a fraction of a 
larger molecule or can be present initially as a discrete molecule, so that the specific sequence constitutes the entire 

40 nucleic acid 

It is not necessary that the sequence to be amplified be present initially in a pure form; it may be a minor fraction 
of a complex mixture, such as a portion of the p-globin gene contained in whole human DNA (as exemplified in Saiki 
et al , Science, 230 , 1 530- 1 534 (1 985)) or a portion of a nucleic acid sequence due to a particular microorganism which 
organism might constitute only a very minor fraction of a particular biological sample. The starting nucleic acid sequence 
45 may contain more than one desired specific nucleic acid sequence which may be the same or different. Therefore, the 
amplification process is useful not only for producing large amounts of one specific nucleic acid sequence, but also for 
amplifying simultaneously more than one drfferent specific nucleic acid sequence located on the same or different 
nucleic acid molecules. 

The nucleic acid(s) may be obtained from any source for example, from plasmids such as pBR322, from cloned 
so DNA or RNA. or from natural DNA or RNA from any source, including bacteria, yeast, viruses organelles, and higher 
organisms such as plants or animals DNA or RNA may be extracted from blood, tissue material such as chorionic villi, 
or amniotic cells by a variety of techniques such as that described by Maniatis et al., supra , p 280-281 . 

If probes are used which are specific to a sequence being amplified and thereafter delected, the cells may be 
diroctly used wrthout extraction of the nucleic acid if they are suspended in hypotonic buffer and heated to about 
55 90-1 00°C, until cell lysis and dispersion of intracellular components occur, generally 1 to 1 5 minutes After the heating 
step the amplification reagents may be added directly to the lysed cells. 

Any specific nucleic acid sequence can be produced by the amplification process It is only necessary that a 
sufficient number of bases at both ends of the sequence be known in sufficient detail so that two oligonucleotide primers 
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can be prepared which will hybridize to different strands of the desired sequence and at relatrve positions along the 
sequence such that an extension product synthesized from one primer when it is separated from its template (com- 
plement), can serve as a template for extension of the other primer into a nucleic acid sequence of defined length. The 
greater the knowledge about the bases at both ends of the sequence, the greater can be the specificity of the primers 

5 for the target nucleic acid sequence, and thus the greater the efficiency of the process 

It will be understood that the word 'primer" as used hereinafter may refer to more than one primer, particularly in 
the case where there is some ambiguity in the information regarding the terminal sequence(s) of the fragment to be 
amplified. For instance in the case where a nucleic acid sequence is inferred from protein sequence information, a 
collection of primers containing sequences representing all possible codon variations based on degeneracy of the 

10 genetic code will be used for each strand One primer from this collection will be homologous with the end of the desired 
sequence to be amplified 

The oligonucleotide primers may be prepared using any suitable method, such as, for example, the phosphotnester 
and phosphodiester methods described above or automated embodiments thereof In one such automated embodi- 
ment, diethylphosphoramidites are used as starting materials and may be synthesized as described by Beaucage et 

is al., Tetrahedron Letters (1981), 22:1859-1662. One method for synthesizing oligonucleotides on a modified solid sup- 
port is described in U.S. Patent No. 4,458,066. It is also possible to use a primer which has been isolated from a 
biological source (such as a restriction endonuclease digest). 

The specific nucleic acid sequence is produced by using the nucleic acid containing that sequence as a template 
The first step involves contacting each nucleic acid strand with four different nucleotide triphosphates and one oligo- 

20 nucleotide primer for each different nucleic acid sequence being amplified or detected If the nucleic acids to be am- 
plified or delected are DNA, then the nucleotide triphosphates are dATR dCTR dGTP and TTP. 

The nucleic acid strands are used as a template for the synthesis of additional nucleic acid strands This synthesis 
can be performed using any suitable method. Generally it occurs in a buffered aqueous solution, preferably at a pH of 
7-9, most preferably about 8. Preferably, a molar excess (for cloned nucleic acid, usually about 1 000 1 primertemplate, 

25 and for genomic nucleic acid, usually about 10 6 :! primer:template) of the two oligonucleotide primers is added to the 
buffer containing the separated template strands. It is understood, however, that the amount of complementary strand 
may not be known if the process herein is used for diagnostic applications, so that the amount of primer relative to the 
amount of complementary strand cannot be determined with certainty. As a practical matter however the amount of 
primer added will generally be in molar excess over the amount of complementary strand (template) when the sequence 

30 to be amplified is contained in a mixture of complicated long-chain nucleic acid strands A large molar excess is preferred 
to improve the efficiency of the process. 

Preferably the concentration of nucleotide triphosphates is 150-200 (iM each in the buffer for amplification and 
MgCI 2 is present in the buffer in an amount of 1.5-2 mM to increase the efficiency and specificity of the reaction. 
The resulting solution is then treated according to whether the nucleic acids being amplified or detected are double 

35 or single-stranded. If the nucleic acids are single-stranded, then no denaturation step need by employed, and the 
reaction mixture is held at a temperature which promotes hybridization of the primer to its complementary target (tem- 
plate) sequence. Such temperature is generally from about 35°C to 65*C or more, preferably about 37-60°C for an 
effective time, generally one-hatf to five minutes, preferably one-three minutes Preferably, 45-58°C is used for Taq 
polymerase and >15-mer primers to increase the specificity of primer hybridization Shorter primers need lower tem- 

40 peratures. 

The complement to the original single-stranded nucleic acid may be synthesized by adding one or two oligonucle- 
otide primers thereto If an appropriate single primer is added, a primer extension product is synthesized in the presence 
of the primer, the thermostable enzyme and the nucleotide triphosphates The product will be partially complementary 
to the single-stranded nucleic acid and will hybridize with the nucleic acid strand to form a duplex of strands of unequal 
45 length which may then be separated into single strands as described above to produce two single separated comple- 
mentary strands Alternatively, two appropriate primers may be added to the single-stranded nucleic acid and the 
reaction carried out 

If the nucleic acid contains two strands, it is necessary to separate the strands of the nucleic acid before it can be 
used as the template. This strand separation can be accomplished by any suitable denaturing method including phys- 

50 ical, chemical or enzymatic means One preferred physical method of separating the strands of the nucleic acid involves 
heating the nucleic acid until it is completely (>99%) denatured Typical heat denaturation involves temperatures rang- 
ing from about 90 to 105°C for times generally ranging from about 0 5 to 5 minutes Preferably the effective denaturing 
temperature is 90-1 00°C for 0.5 to 3 minutes Strand separation may also be induced by an enzyme from the class of 
enzymes known as hclicases or the enzyme RecA. which has hohcase activity and in the presence of riboATP is known 

55 to denature DNA The reaction conditions suitable for separating the strands of nucleic acids with helicases are de- 
scribed by Kuhn. B . Abdel-Monem, M and Hoffmann-Berling K. CSH-Quantitative Biology , 43: 63-67 (1978), and 
techniques for using RecA are reviewed in C Raddinq, Ann. Rev Genetics , 16.405-37 (1962). The denaturation pro- 
duces two separated complementary strands of equal or unequal length 
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If the double-stranded nucieic acid is denatured by heat, the reaction mixture is allowed to cool to a temperature 
which promotes hybridization of each primer present to its complementary target (template) sequence This tempera- 
ture is usually from about 35 D C to 65°C or more, depending on reagents, preferably 37-60°C, maintained for an effective 
time, generally 0.5 to 5 minutes, and preferably 1 -3 minutes. In practical terms, the temperature is simply lowered from 
5 about 95°C to as low as 37°C, preferably to about 45-58°C for Taq polymerase, and hybridization occurs at a temper- 
ature within this range 

Whether the nucleic acid is single- or double-stranded, the thermostable enzyme may be added at the denaturation 
step or when the temperature is being reduced to or is in the range for promoting hybridization. The reaction mixture 
is then heated to a temperature at which the activity of the enzyme is promoted or optimized, ie , a temperature 
10 sufficient to increase the activrty of the enzyme in facilitating synthesis of the primer extension products from the hy- 
bridized primer and template The temperature must actually be sufficient to synthesize an extension product of each 
primer which is complementary to each nucleic acid template, but must not be so high as to denature each extension 
product from its complementary template (i.e., the temperature is generally less than about eO°C-90*C) 

Depending mainly on the types of enzyme and nucleic aod(s) employed, the typical temperature effective for this 
75 synthesis reaction generally ranges from about 40 to 80 ft C, preferably 50-75°C The temperature more preferably 
ranges from about 65-75°C when a DN A polymerase from Therm us aquatic us is employed The period of time required 
for this synthesis may range from about 0 5 to 40 minutes or more, depending mainly on the temperature, the length 
of the nucleic acid, the enzyme and the complexity of the nucleic acid mixture, preferably one to three minutes If the 
nucleic acid is longer, a longer time period is generally required. The presence of dimethylsulfoxide (DMSO) is not 
20 necessary or recommended because DMSO was found to inhibit Taq polymerase enzyme activity 

The newly synthesized strand and its complementary nucleic acid strand form a double -stranded molecule which 
is used in the succeeding steps of the process In the next step, the strands of the double-stranded molecule are 
separated by heat denaturation at a temperature effective to denature the molecule, but not so high that the thermosta- 
ble enzyme is completely and irreversibly denatured or inactivated. Depending mainly on the type of enzyme and the 
2$ length of nucleic acid, this temperature generally ranges from about 90 to 105°C, more preferably 90-100°C, and the 
time for denaturation typically ranges from 0 5 to four minutes, depending mainly on the temperature and nucleic acid 
length. 

After this time, the temperature is decreased to a level which promotes hybridization of the primer to its comple- 
mentary single-stranded molecule (template) produced from the previous step. Such temperature is described above 

30 After this hybridization step, or in lieu of (or concurrently with) the hybridization step, the temperature is adjusted 

to a temperature that is effective to promote the activity of the thermostable enzyme to enable synthesis of a primer 
extension product using as template the newly synthesized strand from the previous step The temperature again must 
not be so high as to separate (denature) the extension product from its template, as previously described (usually from 
40 to 80*C for 0 5 to 40 minutes, preferably 50 to 70°C for one-three minutes) Hybridization may occur during this 

55 step, so that the previous step ol cooling after denaturation is not required. In such a case, using simultaneous steps, 
the preferred temperature range is 50-70°C. 

The heating and cooling steps of strand separation, hybridization, and extension product synthesis can be repeated 
as often as needed to produce the desired quantity of the specific nucleic acid sequence, depending on the ultimate 
use. The only limitation is the amount of the primers, thermostable enzyme and nucleotide triphosphates present. 

40 Preferably, the steps are repeated at least twice. For use in detection, the number of cycles will depend, e.g., on the 
nature of the sample. For example, fewer cycles will be required if the sample being amplified is pure. If the sample is 
a complex mixture of nucleic acids, more cycles will be required to amplify the signal sufficiently for its detection For 
general amplification and detection, preferably the process is repeated at least 20 times 

When labeled sequence -specific probes are employed as described below, preferably the steps are repeated at 

4 5 least five times When human genomic DNA is employed with such probes, the process is repeated preferably 15-30 
times to amplify the sequence sufficiently that a clearly detectable signal is produced, i e , so that background noise 
does not interfere with detection. 

As will be described in further detail below, the amount of the specific nucleic acid sequence produced will accu- 
mulate in an exponential fashion. 

so No additional nucleotides, primers or thermostable enzyme need be added after the initial addition, provided that 

the enzyme has not become denatured or inactivated irreversibly, in which case it is necessary to replenish the enzyme 
after each denaturing step. Addition of such materials at each step however, will not adversely affect the reaction. 

When it is desired to produce more than one specific nucleic acid sequence from the first nucleic acid or mixture 
of nucleic acids, the appropriate number of different oligonucleotide primers are utilized. For example, if two different 

55 specific nucleic acid sequences are to be produced, four primers are utilized Two of the primers are specific for one 
of the specific nucleic acid sequences and the other two primers are specific for the second specific nucleic acid 
sequence In this manner, each of the two different specific sequences can be produced exponentially by the present 
process 



14 



EP 0 776 970 A1 



After the appropriate length of time has passed To produce the desired amount of the specific nucleic acid sequence, 
the reaction may be halted by inactivating the enzyme in any known manner (e g , by adding EDTA, phenol, SDS or 
CHCI 3 ) or by separating the components of the reaction. 

The amplification process may be conducted continuously. In one embodiment of an automated process, the re- 
s action mixture may be temperature cycled such that the temperature is programmed to be controlled at a certain level 
for a certain time 

One such instrument for this purpose is an automated machine for handling the amplification reaction of this in- 
vention. This instrument utilizes a liquid handling system under computer control to make liquid transfers of onzyme 
stored at a controlled temperature in a first receptacle into a second receptacle whose temperature is controlled by 
10 the computer to conform to a certain incubation profile The second receptacle stores the nucleic acid sequence(s) to 
be amplified plus the nucleotide triphosphates and primers The computer includes a user interface through which a 
user can enter process parameters that control the characteristics of the various steps in the amplification sequence 
such as the times and temperatures of incubation, the amount of enzyme to transfer, etc 

A preferred machine that may be employed utilizes temperature cycling without a liquid handling system because 
*5 the enzyme need not be transferred at every cycle. Such a machine consists of the following systems: 

1. A heat-conducting container for holding a given number of tubes, preferably 500 uJ tubes, which contain the 
reaction mixture of nucleotide triphosphates, primers ; nucleic acid sequences, and enzyme. 

2. A means to heat, cool, and maintain the heat-conducting container above and below room temperature, which 
20 means has an input for receiving a control signal for controlling which of the temperatures at or to which the 

container is heated, cooled or maintained. (These may be Peltier heat pumps available from Materials Electronics 
Products Corporation in Trenton, NJ or a water heat exchanger ) 

3. A computer means (e.g. , a microprocessor controller), coupled to the input of said means, to generate the signals 
that control automatically the amplification sequence, the temperature levels, and the temperature ramping and 

2B timing. 

In another embodiment, the enzyme used for the synthesis of primer extension products can be immobilized in a 
column. The other reaction components can be continuously circulated by a pump through the column and a heating 
coil in series. Thus, the nucleic acids produced can be repeatedly denatured without inactivating the enzyme. 

30 The amplification protocol is demonstrated diagrammatically below, where double- stranded DNA containing the 

desired sequence [S] comprised of complementary strands [S + ] and [S - ] is utilized as the nucleic acid During the first 
and each subsequent reaction cycle, extension of each oligonucleotide primer on the original template will produce 
one new ssDNA molecule product of indefinite length that terminates with only one of the primers. These products, 
hereafter referred to as "long products," will accumulate in a linear fashion; that is, the amount present after any number 

35 of cycles will be proportional to the number of cycles. 

The long products thus produced will act as templates for one or the other of the oligonucleotide primers during 
subsequent cycles and will produce molecules of the desired sequence [S + ] or [S] These molecules will also function 
as templates for one or the other of the oligonucleotide primers, producing further [S + ] and [S ], and thus a chain 
reaction can be sustained that will result in the accumulation of [S] at an exponential rate relative to the number of cycles. 

40 By-products formed by oligonucleotide hybridizations other than those intended are not self -catalytic (except in 

rare instances) and thus accumulate at a linear rate. 

The specific sequence to be amplified, [S], can be depicted diagrammatically as 

45 [S + ] 5' AAAAAAAAAAXXXXXXXXXXCCCCCCCCCC 3' 

[$"] 3' TTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5' 

The appropriate oligonucleotide primers would be: 

50 

Primer 1; 3' GGGGGGGGGG 5* 

Primer 2: 5' AAAAAAAAAA 3' 

so that if DNA containing [S] 

55 
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zzzzzzzzzzzzzzzzAAAAMAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz 

.... zzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGGzzzzzzzzzzzzzzzz 

is separated into single strands and its single strands are hybridized to Primers 1 and 2, the following extension reactions 
can be catalyzed by a thermostable polymerase in the presence of the four nucleotide triphosphates: 

3' 5' 

extends^— GGGGGGGGGG Primer 1 

. .zzzzzzzzzzzzzzzzAAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz 

original template strand 

original template strand" 

. . . .zzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGGzzzzzzzzzzzzzzzz . . . . 

Primer 2 AAAAAAAAAA > extends 

5' 3' 

On denaturation of the two duplexes formed, the products are 

3' 5' 
.... zzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 
newly synthesized long product 1 

5' 3' 

zzzzzzzzzzzzzzzzAAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz 

original template strand 

3* 5' 
.... zzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGGzzzzzzzzzzzzzzzz .... 
original template strand" 

5' 3' 
AAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz. . . . 
newly synthesized long product 2 

If these four strands are allowed to rehybridize with Primers 1 and 2 in the next cycle, the thermostable polymerase 
will catalyze the following reactions: 
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Primer 2 5' AAAAAAAAAA ■ ) extends to here 

3* . . . . zzzzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5' 
newly synthesized long product 1 

extends^— GGGGGGGGGG 5' Primer 1 

5' . . ..zzzzzzzzzzzzzzAAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzz 3' 

original template strand 

Primer 2 5' AAAAAAAAAA ^extends 

3 1 ... . zzzzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYGGGGGGGGGGzzzzzzzzzz 5 ' 

original template strand" 

extends to here^ — ■ GGGGGGGGGG 5' Primer 1 

5' AAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz. .3' 
newly synthesized long product 2 

If the strands of the above four duplexes are separated, the following strands are found: 

5' AAAAAAAAAAXXXXXXXXXXCCCCCCCCCC 3' 
newly synthesized [S ] 

3' . . . .zzzzzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5 l 
first cycle synthesized long product 1 

3' zzzzzzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5' 

newly synthesized long product 1 

5 ' . . . .zzzzzzzzzzzzzzzzzzzAAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzz. . . .3* 
original template strand 

5 ' AAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzzz . . . 3 ' 
newly synthesized long product 2 

3' . .zzzzzzzzzzzzzzzTTTTTTTTTTYYYYYYYYYYGGGGGGGGGGzzzzzzzzzzzzzzzz. . .5' 
original template strand" 

3' TTTTTTTTTTYYYYYYYYYYGGGGGGGGGG 5 1 
newly synthesized [S~] 

5' AAAAAAAAAAXXXXXXXXXXCCCCCCCCCCzzzzzzzzzzzzzzz. . .3' 
first cycle synthesized long product 2 

It is seen that each strand which terminates with the oligonucleotide sequence of one primer and the complementary 
sequence of the other is the specific nucleic acid sequence [S] that is desired to be produced 

The amount of original nucleic acid remains constant in the entire process because it is not replicated. The amount 
of the long products increases linearly because they are produced only from the original nucleic acid The amount of 
the specrfic sequence increases exponentially Thus the specific sequence wtll become the predominant species This 
is illustrated in the following table which indicates the relative amounts of the species theoretically present after n 
cycles, assuming 100% efficiency at each cycle: 
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Number of Double Strands After 0 to n Cycles 
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When a single-stranded nucleic acid is utilized as the template, only one long product is formed per cycle. 

A sequence within a given sequence can be amplified after a given number of amplifications lo obtain greater 
specificity of the reaction by adding after at least one cycle of amplification a set of primers that are complementary to 
internal sequences (that are not on the ends) of the sequence to be amplified Such primers may be added at any 

20 stage and will provide a shorter amplified fragment. Alternatively, a longer fragment can be prepared by using primers 
with non -complementary ends but having some overlap with the primers previously utilized in the amplification. 

The method herein may be utilized to clone a particular nucleic acid sequence for insertion into a suitable expression 
vector. The vector may then be used to transform an appropriate host organism to produce the gene product of the 
sequence by standard methods of recombinant DNA technology. 

25 The amplification process herein may yield a mixture of nucleic acids, resulting from the original template nucleic 

acid, the expected target amplified products, and various background non-target products The amplified product can 
also be a mixture if the original template DMA contains multiple target sequences, such as in a heterozygous diploid 
genome or when there is a family of related genes. 

The primers herein may be modified to assist the rapid and specific cloning of the mixture of DNAs produced by 

30 the amplification reaction In one such modification, a restriction site is contained in each of the primers or in the 
sequence to be amplified and cloned. Preferably, the same or different restriction sites are incorporated at the 5' ends 
of the primers to result in restriction sites at the two ends of the amplified product. When cut with the appropriate 
enzymes, the amplified product can then be easily inserted into plasmid or viral vectors and cloned. This cloning allows 
the analysis or expression of individual amplified products, not a mixture 

35 II the primers have restriction sites incorporated therein, the same restriction site can be used for both primers. 

The use, however, of different sites allows the insertion of the product into the vector with a specific orientation and 
suppresses multiple insertions as well as insertions arising from amplifications based on only one of the two primers. 
The specific orientation is useful when cloning into single-strand sequencing vectors, when single-strand hybridization 
probes are used, or when the cloned product is being expressed. 

*o One method to prepare the primers is to choose a primer sequence which differs minimally from the target se- 

quence. Regions in which each of the primers is to be kxated are screened for homology to restriction sites appropriate 
to the desired vector For example, the target sequence "C AGTATCCG A " differs by only one base from one containing 
a Bam HI site A primer sequence is chosen to match the target exactly at its 3' end, and to contain the altered sequence 
and restriction site near its 5' end (for example, "CAGgATCCGA .', where the lowercase letter symbolizes a mismatch 

4$ with the target sequence). This minimally altered sequence will not interfere with the ability of the primer to hybridize 
to the original target sequence and to initiate polymerization After the first amplification cycle the primer is copied, 
becomes the target, and matches exactly with new primers 

After the amplification process, the products are cleaved with the appropriate restriction enzymes, the restriction 
digest is optionally separated from inhibitors of ligation such as the nucleotide triphosphates and salts by, for example. 

50 passing over a desalting column, or a molecular weight chromatography column or through a membrane, and the 
digestion product(s) containing the amplified sequence to be cloned is/are inserted by ligation into a cloning vector 
such as bacteriophage M13. The cloning vector generally has a selectable marker and may optionally also have a 
promoter. The gene may then be sequenced and/or expressed if it codes for a protein using well known techniques 
The gene may also be sequenced by adding an appropriate primer during the amplification process which primer is 

55 complementary to the desired portion which is to be sequenced The primer will form an extension product, and the 
extent of amplification with such extension product will provide sequence information 

Another method for preparing the primers involves taking the 3' end of the primers from the target sequence and 
adding the desired restriction sitefs) to the 5' end of the primer. For the above example, a Hindi II site could bo added 
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to make the sequence 'cgaagcttC AGTATCCGA. . ', where lower case letters are as described above The added bases 
would not contribute to the hybridization in the first cycle ot amplification, but would match in subsequent cycles. The 
final amplified products are then cut with restriction enzyme(s) and cbned and expressed as described above. The 
gene being amplified may be for example, human beta-hemoglobin or the human HLA DO DR or DP-a and -p genes 

5 In an alternative, but less preferred and less efficient, method of cloning wherein blunt-end ligation is employed 

rather than sticky-end ligation (using restriction enzymes) the basic amplification procedure is employed wrthout con- 
cern for restriction sites in the primers or sequence(s) to be cloned The steps must be repeated sufficiently, however 
to produce enough amplified sequence(s) to effect ligation. Blunt-end ligation requires greater concentrations of se- 
quence^) and cloning vector(s) to be present than sticky-end ligation In addition, the ligation must take place in the 

'0 presence of a ligase, such as T4 ligase, E_ coli ligase and ligase. Once the amplified product is obtained, the ligation 
procedure is a standard procedure using conditions well known to those skilled in the art. 

The cloning method which does not involve blunt end ligation controls the orientation or multiplicity of insertion of 
the amplified product into the cloning vector. 

In addition, the process herein can be used for jn vitro mutagenesis The oligonucleotide primers need not be 

'5 exactly complementary to the nucleic acid sequence which is being amplified. It is only necessary that they be able to 
hybridize to the sequence sufficiently well to be extended by the thermostable enzyme. The product of an amplificatbn 
reaction wherein the primers employed are not exactly complementary to the original template will contain the sequence 
of the primer rather than the template, thereby introducing an jn vitro mutation In further cycles this mutation will be 
amplified with an undiminished efficiency because no further mispaired primings are required. The mutant thus pro- 

20 duced may be inserted into an appropriate vector by standard molecular biological techniques and might confer mutant 
properties on this vector such as the potential for production of an altered protein. 

The process of making an altered DNA sequence as described above could be repeated on the altered DNA using 
different primers to induce further sequence changes. In this way a series of mutated sequences could gradually be 
produced wherein each new addition to the series could differ from the last in a minor way, but from the original DNA 

25 source sequence in an increasingly major way. In this manner changes could be made ultimately which were not 
feasible in a single step due to the inability of a very seriously mismatched primer to function 

In addition, the primer can contain as part of its sequence a non complementary sequence, provided that a sufficient 
amount of the primer contains a sequence which is complementary to the strand to be amplified. For example, a 
nucleotide sequence which is not complementary to the template sequence (such as, e.g., a promoter, linker, coding 

30 sequence, etc ) may be attached at the 5' end of one or both of the primers, and thereby appended to the product of 
the amplification process. After the extension primer is added, sufficient cycles are run to achieve the desired amount 
of new template containing the non-complementary nucleotide insert. This allows production of large quantities ot the 
combined fragments in a relatively short period of time (e g., two hours or less) using a simple technique. 

The method herein may also be used to enable detection and/or characterization of specific nucleic acid sequences 

05 associated with infectious diseases, genetic disorders or cellular disorders such as cancer e.g., oncogenes Amplifi- 
cation is useful when the amount of nucleic acid available for analysis is very small, as, for example, in the prenatal 
diagnosis of sickle cell anemia using DNA obtained from fetal cells. Amplification is particularly useful if such an analysis 
is to be done on a small sample using non -radioactive detection techniques which may be inherently insensitive, or 
where radioactive techniques are being employed but where rapid detection is desirable 

40 For purposes of this invention genetic diseases may include specific deletions and/or mutations in genomic DNA 

trom any organism, such as, e.g., sickle cell anemia, a-thalassemia, p-thalassemia, and the like. Sickle cell anemia 
can be readily detected via oligomer restriction analysis as described in EP Patent Publication 164,054 published 
December 11,1 965, or via a RFLP-like analysis following amplificatbn of the appropriate DNA sequence by the present 
method u-Thalassemia can be detected by the absence of a sequence, and p-thalassemia can be detected by the 

45 presence of a polymorphic restriction site closety linked to a mutation that causes the disease 

All of these genetic diseases may be detected by amplifying the appropriate sequence and analyzing it by Southern 
blots without using radioactive probes In such a process, for example, a small sample of DNA from, eg, amniotic 
fluid containing a very low level of the desired sequence is amplified, cut with a restriction enzyme, and analyzed via 
a Southern blotting technique The use of non-radioactive probes is facilitated by the high level of the amplified signal 

50 in another embodiment, a small sample of DNA may be amplified to a convenient level and then a further cycle of 

extension reactions performed wherein nucleotide derivatives which are readily detectable (such as ^P-labeled or 
biot m- labeled nucleotide triphosphates) are incorporated directly into the final DNA product which may be analyzed 
by restriction and electrophorelic separation or any other appropriate method 

In a further embodiment, the nucleic acid may be exposed to a particular restriction cndonuclease prior to ampli- 

55 fication Since a sequence which has been cut cannot be amplified, the appearance of an amplified fragment, despite 
prior restriction of the DNA sample, implies the absence of a site for the endonuclease within the amplified sequence. 
The presence or absence of an amplified sequence can be detected by an appropriate method. 

A practical application of this technique can be illustrated by its use in facilitating the detection of sickle cell anemia 
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via the oligomer restriction technique described herein and in EP 164,054, supra and Saiki et al , Bio/Tec hnology , 3 
pp 1008-1012 (1985) Sickle cell anemia is a hemoglobin disease which is caused by a single base pair change in the 
sixth codon of the p-globin gene 

The method of this invention may also be used to detect directly single-base pair variations in nucleic acid sequence 

5 (such as genomic DNA) using sequence -specific oligonucleotides In this method, the sequence variation, whether 
resulting trom cancer, an infectious disease, or a genetic disease, e g , a genetic lesion, is directly detected, eliminating 
the need tor restriction digestion, electrophoresis, and gel manipulations otherwise required. The use of sequence- 
specific oligonucleotides in a dot blot format after amplification, as described herein, provides for improved specificity 
and sensitivity of the probe; an interpretable signal can be obtained with a 0 04 u.g sample in six hours Also, if the 

10 amount of sample spotted on a membrane is increased to 0 1-0 5 jig, non-isotopically labeled oligonucleotides may 
be utilized rather than the radioactive probes used in previous methods. Furthermore : the process described herein- 
below is applicable to use of sequence-specific oligonucleotides less than 19-mers in size, thus allowing use of more 
discriminatory sequence -specific oligonucleotides 

Regarding genetic diseases while RFLP requires a polymorphic restriction site to be associated with the disease, 

'5 sequence-specific oligonucleotides directly detect the genetic lesion and are generally more useful for the analysis of 
such diseases as hemoglobin C disease, a- 1 -antitrypsin and ^-thalassemia, which result from single-base mutations. 
In addition, the oligonucleotides can be used to distinguish between genetic variants which represent different alleles 
(e.g., HLA typing), indicating the feasibility of a sequence-specific oligonucleotide-based HLA typing kit that includes 
a thermostable enzyme. 

20 In one embodiment of the invention herein wherein a nucleotide variation in sequence is to be detected the sample, 

amplified as described above using one primer for each strand of each nucleic acid suspected of containing the nu- 
cleotide variation, is spotted directly on a series ol membranes and each membrane is hybridized with a different 
labeled sequence-specific oligonucleotide probe. One procedure for spotting the sample on a membrane is descibed 
by Kafotos et al., Nucleic Acids Research , 7:1541-1552 (1979) 

25 Briefly, the DNA sample affixed to the membrane may be pretreated with a prehybridization solution containing 

sodium dodecyl sulfate, Ficoll, serum albumin and various salts prior to the proba being added Then, a labeled oligo- 
nucleotide probe which is specific to each sequence variation to be detected is added to a hybridization solution similar 
to the prehybridization solution. The hybridization solution is applied to the membrane and the membrane is subjected 
to hybridization conditions that will depend on the probe type and length, the type and concentration of ingredients, 

30 etc Generally, hybridization is carried out at about 25 to 75*C, preferably 35 to 65°C, for 0 25-50 hours, preferably 
less than three hours. The greater the stringency of conditions, the greater the required complementarity for hybridi- 
zation between the probe and sample. If the background level is high, stringency may be increased accordingly The 
stringent conditions can also be incorporated in the wash. 

After the hybridization, the sample is washed of unhybridized probe using any suitable means such as by washing 

05 one or more times with varying concentrations of standard saline phosphate EDTA (SSPE) (180 mM NaCI , 10 mM 
NaHP0 4 and 1 mM EDTA, pH 7.4) solutions at 25-75°C for about 10 minutes to one hour, depending on the temperature 
The label is then detected by using any appropriate detection technique. 

The sequence-specific oligonucleotide employed herein ts an oligonucleotide which is generally prepared and 
selected as desenbed above for preparing and selecting the primers. As described above, the sequence-specific oli- 

40 gonucleotide must encompass the region of the sequence which spans the nucleotide variation being detected and 
must be specific for the nucleotide variation being detected. For example if it 'S desired to detect whether a sample 
contains the mutation for sickle cell anemia, one oligonucleotide will be prepared which contains the nucleotide se- 
quence site characteristic of the normal p-globin gene and one oligonucleotide will be prepared which contains the 
nucleotide sequence characteristic of the sickle cell allele. Each oligonucleotide would be hybridized to duplicates of 

45 the same sample to determine whether the sample contains the mutation. 

The polymorphic areas of HLA class II genes are localized to specific regions of the first exon and are flanked by 
conserved sequences, so that oligonucleotide primers complementary to opposite strands of the conserved 5' and 3' 
ends of the first exon can be prepared 

The number of oligonucleotides employed for detection of the polymorphic areas of the HLA class II genos will 

50 vary depending on the type of gene, which has regions of base pair variation which may be cluslered or spread apart 
If the regions are clustered, as in the case with HLA-DQ-a, then one oligonucleotide is employed lor each allele 11 the 
regions are spread apart, as in the case with HLA-DQ-P and HLA-DR-p, then more than one probe, each encompassing 
an allelic variant, will be used for each allele.. In the case of HLA-DQ-p and HLA-DR-p, three probes are employed for 
the three regions of the locus where allelic variations may occur. For detection of insulin-dependent diabetes mellrtus 

55 (IDDM) four probes lor the HLA-DRp second exon are employed 

Hapbtypes can be inferred from segregation analysis in families or in some cases, by direct analysis of the indi- 
vidual DNA sample. Specific allelic combinations (haplotypes) of sequence-specific oligonucleotide reactivities can be 
identified in heterozygous cells by using restriction enzyme digestion of the genomic DNA prior to amplification. 
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For example., if >n DQp one finds three highly variable subregions A, B, and C within a single amplified region and 
if there are six different sequences at each region (A1-6, B1-6 C1-6) then an individual could be typed in the DQP 
locus by sequence-specific oligonucleotide probe analysis as containing A1, A2, B2, B3, C1, C4, with the possible 
haplotype combinations of A1, B2, C1; A1, B2, C4; A2, B2 C1: A2, B2, C4; A1. B3, C1; A1, B3, C4; A1, B2, C1; and 
S A1.B2.C4. 

If the genomic DNA is digested with a polymorph ic restriction enzyme prior to amplification, and if the enzyme cuts 
both alleles between the primers, there is no reactivity with the sequence-specific probes due to lack of amplification, 
and the result is uninformative If the enzyme cuts neither allele the probe resufts with digested and undigested genomic 
DNA are the same and the result is unmformative It the enzyme cuts only one allele, however, then one can infer both 
10 haplotypes by comparing the probe reactivity patterns on digested and undigested DNA 

The haplotypes can be deduced by comparing sequence-specific oligonucleotide reactivities with uncut genomic 
DNA and genomic DNA cut with one or several enzymes known to be polymorphic and to recognize sites between the 
primers 

The length of the sequence-specific oligonucleotide will depend on many factors, including the particular target 
molecule being detected : the source of oligonucleotide, and the nucleotide composition. For purposes herein, the 
sequence-specific oligonucleotide typically contains 15-25 nucleotides, although it may contain more or fewer nucle- 
otides. While oligonucleotides which are at least 19 mers in length may enhance specificity and/or sensitivity, probes 
which are less than 19 mers, e.g., 16-mers, may show more sequence-specific discrimination presumably because a 
single mismatch is more destabilizing. Because amplification increases specificity so that a longer length is less critical, 

20 and hybridization and washing temperatures can be lower for the same salt concentration, it is preferred to use oligo- 
nucleotides which are less than 19 mers in length. 

Where the sample is first placed on the membrane and then detected with the oligonucleotide, the oligonucleotide 
must be labeled with a suitable label moiety, which may be detected by spectroscopic, photochemical, biochemical, 
immunochemical or chemical means Immunochemical means include antibodies which are capable of forming a com- 

25 piex with the oligonucleotide under suitable conditions, and biochemical means include polypeptides or lectins capable 
of forming a complex with the oligonucleotide under the appropriate conditions. Examples include fluorescent dyes, 
electron-dense reagents, enzymes capable of depositing insoluble reaction products or being detected chromogeni- 
caily such as horseradish peroxidase, alkaline phosphatase, a radioactive label such as 32 P, or bbtin. If biotin is em- 
ployed, a spacer arm may be utilized to attach it to the oligonucleotide Preferably, the label moiety is horseradish 

30 peroxidase 

Alternatively, in one "reverse* dot blot format, at least one of the primers and/or at least one of the four nucleotide 
triphosphates is labeled with a detectable label so that the resulting amplified sequence is labeled. These labeled 
moieties may be present initially in the reaction mixture or added during a later cycle of the amplification to introduce 
the label to the amplification product Then an unlabeled sequence-specific oligonucleotide capable of hybridizing with 

3S the amplified nucleic acid sequence, if the sequence variation(s) (whether normal or mutant) is/are present, is spotted 
on (affixed to) the membrane under prehybridization conditions as described above The amplified sample is then 
added to the pretreated membrane under hybridization conditions as described above. Finally, detection means are 
used to determine ff an amplified sequence in the nucleic acid sample has hybridized to the oligonucleotide affixed to 
the membrane Hybridization will occur only if the membrane-bound sequence containing the variation is present in 

40 the amplification product i.e. , onfy if a sequence of the probe is complementary to a region of the amplified sequence 
In another version of the "reverse" dot blot format, the amplification is carried out without employing a label as with 
the "forward" dot blot format described above, and a labeled sequence-specific oligonucleotide probe capable of hy- 
bridizing with the amplified nucleic acid sequence containing the variation, if present, is spotted on (affixed to) the 
membrane under prehybridization conditions as described above The amplified sample is then added to the pretreated 

45 membrane under hybridization conditions as described above. Then the labeled oligonucleotide or a fragment thereof 
is released from the membrane in such a way that a detection means can be used to determine if an amplified sequence 
in the sample hybridized to the labeled oligonucleotide The release may take place, for example, by adding a restriction 
enzyme to the membrane which recognizes a restriction site in the probe. This procedure, known as oligomer restriction, 
is described more fully in EP Patent Publication 164,054 published December 11, 1985. 

so in both the forward and reverse dot blot methods, the genetic diseases which may be detected include specific 

deletions, insertions and/or substitutions in any base pair mutation or polymorphism in nucleic acids, for example, 
genomic DNA, from any organism Examples of diseases in which base pair variation is known include sickle cell 
anemia, hemoglobin C disease, a-thalassemia, (5-thalassemia and the like. Other diseases that may be detected 
include cancerous diseases such as those involving the RAS oncogenes, e.g.. the n-RAS oncogene, and infectious 

55 diseases 

A dot blot process may also be used for HLA typing in the areas of tissue transplantation, disease susceptibility 
and paternity determination. The HLA class II genes, consisting of the a and p genes from the HLA-DR, HLA-DQ and 
HLA-DP regions, are highly polymorphic their genetic complexity at the DNA level is significantfy greater than the 
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polymorphism currently defined by serological typing. In addrtion, the process heretn may be employed to detect four 
DNA sequences coding for HLA class II 0 proteins (e.g. DRp) associated with insulin-dependent diabetes mellrtus 
(IDDM). Briefly, the four DNA sequences associated with IDDM are selected from the group consisting of 

1) 5 1 -GAGCTGCGTAAGTCTGAG-3 1 , 



2) 5* -GAGGAGTTCCTGCGCTTC-3' , 



3 ) 5 ' -CCTGTCGCCGAGTCCTGG-3 ' , 

and 



4 ) 5 ' -GACATCCTGGAAGACGAGAGA-3 ' t 

or the DNA strands that are complementary thereto. Sequence-specific probes may be prepared that will hybridize to 
one or more of these sequences. 

Various infectious diseases can be diagnosed by the presence in clinical samples of specific DNA sequences 
characteristic of the causative microorganism. These include bacteria, such as Salmonella, Chlamydia, Neisseria, 
viruses, such as the hepatitis viruses, and parasites, such as the Plasmodium responsible for malaria U.S. Patent 
Reexamination Certificate B1 4,358,535 issued on May 13, 1986 to Falkow et al describes the use of specific DNA 
hybridization probes for the diagnosis of infectious diseases. A relatively small number of pathogenic organisms may 
be present in a clinical sample from an infected patient and the DNA extracted from these may constitute only a very 
small fraction of the total DNA in the sample Specific amplification of suspected sequences prior to immobilization 
and hybridization detection of the DNA samples could greatly improve the sensitivity and specificity of traditional pro- 
cedures. 

Routine clinical use of DNA probes for the diagnosis of infectious diseases would be simplified considerably if non- 
radioactive^ labeled probes could be employed as described in EP 63,879 to Ward. In this procedure biotin -containing 
DNA probes are detected by chromogenic enzymes linked to avidin or biotin -specific antibodies This type of detection 
is convenient, but relatively insensitive The combination of specific DNA amplification by the present method and the 
use of stably labeled probes could provide the convenience and sensitivity required to make the Falkow and Vterd 
procedures useful in a routine clinical setting. 

A specrfic use of the amplification technology for detecting or monitoring for the AIDS virus is described as follows 
The amplification and detection process is used with primers and probes which are designed to amplify and detect, 
respectively, nucleic acid sequences which are substantially conserved among the nucleic acids in AIDS viruses and 
specific to the nucleic acids in AIDS viruses Thus, the sequence to be detected must be sufficiently complementary 
to the nucleic acids in AIDS viruses to initiate polymerization, preferably at room temperature, in the presence of the 
enzyme and nucleotide triphosphates 

In addition the probe may be a biotinylated probe in which the biotin is attached to a spacer arm of the formula: 

H 

-Y-(CH 2 ) 2 -0-[(CH 2 ) x O] y -CH 2 CH 2 -N- 

where Y is 0, NH or N-CHO, x is a number from 1 to 4, and y is a number from 2 to 4 The spacer arm is in turn attached 
to a psoralen moiety of the formula 
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5 




The psoralen moiety intercalates into and crosslinks a "gapped circle" probe as described by Courage-Tebbe et al , 
10 Biochim Biophys. Acta, 697 (1982) 1-5, wherein the single-stranded hybridization region of the gapped circle spans 
the region contained in the primers. The details of this biotinylation and dot blot procedure are described more fully in 
commonly assigned U.S. Patent No. 4,582,789 issued April 15, 1986 and U S Patent No. 4,617,261 issued October 
14, 1986 

The amplification process can also be utilized to produce sufficient quantities of DNA from a single copy human 
is gene such that detection by a simple non-specific DNA stain such as ethidium bromide can be employed to diagnose 
DNA directly. 

In addition to detecting infectious diseases and pathological abnormalities in the genome of organisms, the am- 
plification process can also be used to detect DNA polymorphisms which may not be associated with any pathological 
state. 

20 in summary, the amplification process is seen to provide a process for amplifying one or more specific nucleic acid 

sequences using a chain reaction and a thermostable enzyme, in which reaction primer extension products are pro- 
duced which can subsequently act as templates for further primer extension reactions The process is especially uselul 
in detecting nucleic acid sequences which are initially present in only very small amounts. 

The following examples are offered by way of illustration only and are by no means intended to limit the scope of 

25 the claimed invention. In these examples, all percentages are by weight if for solids and by volume if for liquids, unless 
otherwise noted, and all temperatures are given in degrees Celsius 

EXAMPLE I 

30 |, Synthesis of the Primers 

The following two oligonucleotide primers were prepared by the method described below 

35 5 1 -ACACAACTGTGTTCACTAGC-3 1 { PC03 ) 

5 1 -CAACTTCATCCACGTTCACC-3 ' { PC04 ) 

These primers, both 20-mers, anneal to opposite strands of the genomic DNA with their 5' ends separated by a distance 
of 110 base pairs. 

40 

A Automated Synthesis Procedures: The diethylphosphoramidites, synthesized according to Beaucage and Caru- 
thers (Tetrahedron Letters (1981) 22 1859-1862) were sequentially condensed to a nucleoside derivatized con- 
trolled pore glass support. The procedure included detritylation with trichloroacetic acid in die htorom ethane con- 
densation using benzotnazole as activating proton donor, and capping with acetic anhydride and dimethylami- 
^5 nopyridine in tetrahydrofuran and pyridine Cycle time was approximately 30 minutes Yields at each step were 

essentially quantitative and were determined by collection and spectroscopic examination of the dimethoxytrityl 
alcohol released during detritylation 

B Oligodeoxyribonucleotide Deprotection and Purification Procedures: Tho solid support was removed from tho 
50 column and exposed to 1 ml concentrated ammonium hydroxide at room temperature for four hours in a closed 

tube The support was then removed by filtration and the solution containing the partially protected oligodeoxynu- 
cleotide was brought to 55°C for five hours. Ammonia was removed and the residue was applied to a preparative 
polyacrylamide gel. Electrophoresis was carried out at 30 volts/cm for 90 minutes afler which the band containing 
the product was identified by UV shadowing of a fluorescent plate. The band was excised and eluted with 1 ml 
55 distilled water overnight at 4 9 C This solution was applied to a RP-HPLC column and eluted with a 7-1 3% gradient 

of acetonitnle m 1 % ammonium acetate buffer at pH 6 0. The elution was monitored by U V absorbance at 260 nm 
and the appropriate fraction collected quantitated by UV absorbance in a fixed volume and evaporated to dryness 
at room temperature in a vacuum centrifuge 
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C Characterization of Oligodecxy ribonucleotides Test aliquots of the purified oligonucleotides were 32 P labeled 
with polynucleotide kinase and y- 32 P-ATP The labeled compounds were examined by autoradiography of 14-20% 
polyacrylamide gels after electrophoresis for 45 minutes at 50 volts/cm. This procedure verifies the molecular 
weight. Base composition was determined by digestion of the oligodeoxy ribonucleotide to nucleosides by use of 
5 venom diesterase and bacterial alkaline phosphatase and subsequent separation and quantitation of the derived 

nucleosides using a reverse phase HPLC column and a 10% acetonitrile, 1% ammonium acetate mobile phase 

II. Isolation of Human Genomic DNA from Cell Line 

10 High molecular weight genomic DNA was isolated from a T cell line, Molt 4, homozygous for normal p-globin 

available from the Human Genetic Mutant Cell Depository, Camden, NJ as GM2219C using essentially the method of 
Maniatis et al supra , p. 2B0-281 . 

III. Purification of a Polymerase From Thermus aquatic us 

15 

Thermus aquaticus strain YT1, available without restriction from the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, MD, as ATCC No. 25,104 was grown in flasks in the following medium: 
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Sodium Citrate 


1 mM 


Potassium Phosphate, pH 7.9 


5 mM 


Ammonium Chloride 


10 mM 


Magnesium Sulfate 


0.2 mM 


Calcium Chloride 


0.1 mM 


Sodium Chloride 


1 g/l 


Yeast Extract 


1 g/i 


Tryptone 


1 g/i 


Glucose 


2g/l 


Ferrous Sulfate 


0.01 mM 



(The pH was adjusted to 8 0 prior to autoclavmg ) 

A 10-liter fermentor was inoculated from a seed flask cultured overnight in the above medium at 70°C A total of 
600 ml from the seed flask was used to inoculate 10 liters of the same medium. The pH was controlled at 8.0 with 
ammonium hydroxide with the dissolved oxygen at 40%, with the temperature at 70°C, and with the stirring rate at 400 
rpm. 

After growth of the cells, they were purified using the protocol (with slight modification) of Kaledin et al., supra 
through the first five stages and using a different protocol for the sixth stage. All six steps were conducted at 4'C. The 
rate of fractionation on columns was 0.5 columns/hour and the volumes of gradients during efution were 10 column 
volumes An alternative and preferred purification protocol is provided in Example VI below. 

Briefly, the above culture of the T. aquaticus cells was harvested by centrifugation after nine hours of cultivation, 
in late log phase, at a cell density of 1 .4 g dry weight/I. Twenty grams of cells were resuspended in 80 ml of a buffer 
consisting of 50 mM Tris HCI pH 7.5, 0.1 mM EDTA. Cells were lysed and the lysate was centrifuged for two hours at 
35,000 rpm in a rotor at 4°C. The supernatant was collected (fraction A) and the protein fraction precipitating between 
45 and 75% saturation of ammonium sulfate was collected, dissolved in a buffer consisting of 0.2 M potassium phos- 
phate buffer, pH 6 5 10 mM 2-mercaptoethanol, and 5% glycerine, and finally dialyzed against the same buffer to yield 
fraction B. 

Fraction B was applied to a 2 2 x 30-cm column of DEAE-cellulose. equilibrated with the above described buffer 
The column was then washed with the same buffer and the fractions containing protein (determined by absorbance at 
280 nm) were collected. The combined protein fraction was dialyzed against a second buffer, containing 0 01 M po- 
tassium phosphate buffer pH 7 5, 10 mM 2-mercaptoethanol, and 5% glycerine, to yield fraction C. 

Fraction C was applied to a 2 6 x 21 -cm column of hydroxyapatite, equilibrated with a second buffer The column 
was then washed and the enzyme was eluted with a linear gradient of 0 01 -0 5 M potassium phosphate buffer, pH 7.5. 
containing 10 mM 2-mercaptoethanol and 5% glycerine. Fractions containing DNA polymerase activity (90-160 mM 
potassium phosphate) were combined, concentrated four-fokj using an Amcon stirred cell and YM10 membrane, and 
dialyzed against the second buffer to yield fraction D 

Fraction D was applied to a 1 .6 x 23-cm column of DEAE-cellulose, equilibrated with the second buffer The column 
was washed and the polymerase was eluted with a linear gradient of 0.01-0 5 M potassium phosphate in the second 
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buffer The fractions were assayed for contaminating endonuciease(s) and exonuclease(s) by elect rophoreticaily de- 
tecting the change in molecular weight of phage X DNA or supercoiled plasmid DNA after incubation with an excess 
of DNA polymerase (for endonuclease) and after treatment with a restriction enzyme that cleaves the DNA into several 
fragments (for exonuclease). Only those DNA polymerase fractions (65-95 mM potassium phosphate) having minimal 

s nuclease contamination were pooled To the pool was added autoclaved gelatin in an amount of 250 u.g/ml, and dialysis 
was conducted against the second buffer to yield Fraction E. 

Fraction E was applied to a phosphocellulose column and eluted with a 100 ml gradient (0.01-0.4 M KCI gradient 
in 20 mM potassium phosphate buffer pH 7.5). The fractions were assayed for contaminating endo/exonuc lease (s) as 
described above as well as for polymerase activity (by the method of Kaledin et al ) and then pooled The pooled 

10 fractions were dialyzed against the second buffer, then concentrated by dialysis against 50% glycerine and the second 
buffer. 

The molecular weight of the polymerase was determined by SDS PAGE Marker proteins were phosphorylase B 
(92,500) : bovine serum albumin (56,200), ovalbumin (45,000), carbonic anhydrase (31 ,000), soybean trypsin inhibitor 
(21,500) and lysozyme (14,400). 
15 Preliminary data suggest that the polymerase has a molecular weight of about 86,000-90,000 dallons, not 

62,000-63,000 daltons reported in the literature (e.g., by Kaledin et al.). 

The polymerase was incubated in 50 uJ of a mixture containing 25 mM Tris-HCI pH 6 4 and pH 8.0, 0 1 M KCI, 10 
mM MgCI^, 1 mM 2-mercaptoethanol, 1 0 nmoles each of dGTP, dATP, and TTP, and 0 5 uCi ( 3 H) dCTP, 8 ug "activated 1 
calf thymus DNA, and 0.5-5 units of the polymerase. "Activated" DNA is a native preparation of DNA after partial 
20 hydrolysis with DNase I until 5% of the DNA was transferred to the acid-soluble fraction. The reaction was conducted 
at 70 ft C for 30 minutes, and stopped by adding 50 uJ of a saturated aqueous solution of sodium pyrophosphate con- 
taining 0 125 M EDTA-Na 2 Samples were processed and activity was determined as described by Kaledin et al , supra 

The results showed that at pH 6.4 the polymerase was more than one-half as active as at pH 8.0. In contrast, 
Kaledin et al. found that at pH about 7 0, the enzyme therein had 8% of the activity at pH 8.3. Therefore, the pH profile 
25 for the thermostable enzyme herein is broader than that for the Kaledin et al. enzyme. 

Finally, when only one or more nucleotide triphosphates were eliminated from a DNA polymerase assay reaction 
mixture, very little, if any, activity was observed using the enzyme herein, and the activity was consistent with the 
expected value, and with an enzyme exhibiting high fidelity. In contrast, the activity observed using the Kaledin et al 
(supra ) enzyme is not consistent with the expected value, and suggests misincorporation of nucleotide triphosphate(s) 

30 

IV. Amplification Reaction 

One microgram of the genomic DNA described above was diluted in an initial 100 uJ aqueous reaction volume 
containing 25 mM Tris HCI buffer (pH 8 0), 50 mM KCI, 10 mM MgCI 2 , 5 mM dithiothreitol, 200 ug/ml gelatin, 1 u.M of 

35 primer PC03, 1 |iM of primer PC04, 1 .5 mM dATP, 15 mM dCTP, 1.5 mM dGTP and 15 mM TTP The sample was 
heated for 10 minutes at 98°C to denature the genomic DNA, then coded to room temperature Four microliters of the 
polymerase from Thermus aquaticus was added to the reaction mixture and overlaid with a 100uJ mineral oil cap. The 
sample was then placed in the aluminum heating block of the liquid handling and heating instrument described above 
employing pipettes programmed to deliver liquids and a temperature-control device to effect temperature changes 

40 The DNA sample underwent 20 cycles of amplification in the machine, repeating the following program cycle: 

1) heating from 37°C to 98°C in heating block over a period of 2.5 minutes, and 

2) cooling from 98°C to 37°C over a period of three minutes to allow the primers and DNA to anneal. 

45 

After the last cycle, the sample was incubated for an additional 10 minutes at 55° C to complete the final extension 
reaction 

V. Synthesis and Phosphorylation of Oligodeoxyribonucleotide Probes 

so 

A labeled DNA probe, designated RS24, of the following sequence was prepared 

5 1 -*CCCACAGGGCAGTAACGGCAGACTTCTCCTCAGGAGTCAG- 3 1 ( RS24 ) 

where * indicates the label. This probe is 40 bases long, spans the fourth through seventeenth codons of the gene, 
and is complementary to the normal p-globm allele (p A ). The schematic diagram of Dnmers and probes is given below 
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j 110 bp v 

a 0-globin ^ 

5 PC03 RS24 KM 

This probe was synthesized according to the procedures described in Section I of Example I The probe was 
labeled by contacting 20 pmole thereof with 4 units of T4 polynucleotide kinase and about 40 pmole y- 32 P-ATP (about 
1 o 7000 Ci/mmole) tn a 40 jil reaction volume containing 70 mM Tris butler (pH 7 6), 10 mM MgCI 2 , 1 5 mM spermine 
and 10 mM dithiolhrertol for 60 minutes at 37 3 C The total volume was then adjusted to 100 u.l with 25 mM EDTA and 
purified according to the procedure of Maniatis et al., supra , p 466-467 over a 1 ml spin dialysis column equilibrated 
with Tris-EDTA (TE) buffer (1 0 mM Tris buffer, 0 1 mM EDTA, pH 8 0). TCA precipitation of the reaction product indicated 
that for RS24 the specific activity was 4.3 (jCi/pmole and the final concentration was 0.118 ujnole/ul 

15 

VI. Dot Blot Hybridizations 

Four microliters of the amplified sample from Section IV and 5.6 ut of appropriate dilutions of p-globin plasmid DNA 
calculated to represent amplification efficiencies of 70, 75, 80, 85, 90, 95 and 100% were diluted with 200 u.l 0.4 N 
20 NaOH, 25 mM EDTA and spotted onto a nylon filter by first wetting the filter with water, placing it in an apparatus for 
preparing dot blots which holds the filters in place, applying the samples, and rinsing each well with 0.1 ml of 20 x 
SSPE (3 6 M NaCI, 200 mM NaH 2 P0 4 , 20 mM EDTA), as disclosed by Reed and Mann, Nucleic Acids Research , J_3 
7202-7221 (1 985). The filters were then removed, rinsed in 20 x SSPE, and baked for 30 minutes at 80*0 in a vacuum 
oven. 

25 After baking, each filter was then contacted with 16 ml of a hybridization solution consisting of 3 x SSPE, 5 x 

Denhardt's solution (1 x - 0 02% polyvinylpyrrolidone, 0 02% Ficoll, 0 02% bovine serum albumin,, 0 2 mM Tris, 0 2 
mM EDTA, pH 8.0), 0 5% SDS and 30% formamide, and incubated for two hours at 42°C. Then 2 pmole of probe RS24 
was added to the hybridization solution and the filter was incubated for two minutes at 42°C 

Finally, each hybridized filter was washed twice with 100 ml of 2 x SSPE and 0.1% SDS for 10 minutes at room 
30 temperature Then the filters were treated once with 100 ml of 2 x SSPE, 0 1% SDS at 60°C tor 10 minutes 
Each filter was then autoradiographed with the signal readily apparent after two hours. 

VII. Discussion of Autoradiogram 

35 The autoradogram of the dot blots was analyzed after two hours and compared in intensity to standard serial 

dilution p-globin reconstructions prepared with Hae l I l /Mae l -digested pBR:^, where p A is the wild-type allele, as de- 
scribed in Saiki et al. , Science , supra . Analysts of the reaction product indicated that the overall amplification efficiency 
was about 95%, corresponding to a 630,000-1old increase in the p-globin target sequence. 

EXAMPLE II 

I Amplification Reaction 

Two 1 u.g samples of genomic DNA extracted from the Molt 4 cell line as described in Example I were each diluted 
45 jn a 100 u.l reaction volume containing 50 mM KCI, 25 mM Tris HCI buffer pH 8.0, 10 mM MgCI 2 , 1 u.M of primer PC03, 
1 uJvl of primer PC04, 200 ug/ml gelatin, 10% dimethylsutf oxide (by volume), and 15 mM each of dATP, dCTP, dGTP 
and TTP After this mixture was heated for 1 0 minutes at 96°C to denature the genomic DNA, the samples were cooled 
to room temperature and 4uJ of the polymerase from Thermus aquaticus described in Example I was added to each 
sample. The samples were overlaid with mineral oil to prevent condensation and evaporative loss. 
so One of the samples was placed in the heating block of the machine described in Example I and subjected to 25 

cycles of amplification, repeating the following program cycle: 

( 1 ) heating from 37 to 93°C over a period of 2 5 minutes, 

(2) cooling from 93'C to 37 3 C over a period of three minutes to allow the primers and DNA to anneal; and 

(3) maintaining at 37°C for two minutes. 
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Alter the last cycle the sample was incubated tor an additional 10 minutes at 60° C to complete the final extension 
reaction. 

The second sample was placed in the heat-conducting container ol a temperature cycling (heating and cooling) 
machine In this machine the heat-conducting container is attached to Peltier heat pumps which adjust the temperature 
s upwards or downwards and a microprocessor controller to control automatically the amplification sequence, The tem- 
perature levels, the temperature ramping and the timing of the temperature 

The second sample was subjected to 25 cycles of amplification, repeating the following program cycle 

(1) heating from 37 to 95°C over a period of three minutes: 

10 

(2) maintaining at 95°C for 0.5 minutes to allow denaturation to occur; 

(3) cooling from 95 to 37°C over a period of one minute and 
is (4) maintaining at 37° C for one minute. 

II. Analysis 

Two tests were done for analysis, a dot blot and an agarose gel analysis. 
20 For the dot blot analysis, a labeled DNA probe, designated RS18, of the following sequence was prepared: 



5 1 -*CTCCTGAGGAGAAGTCTGC-3 ' (RS18) 

25 where * indicates the label. This probe is 19 bases long, spans the fourth through seventeenth codons of the gene, 
and is complementary to the normal p-globin allele The schematic diagram of primers and probes is given below 



y 110 bp \ 

30 ^ 6-globin ' 

f>C03 R$1S PC04 

35 This probe was synthesized according to the procedures described in Section I of Example I. The probe was 

labeled by contacting 10 pmole thereol with 4 units of T4 polynucleotide kinase and about 40 pmole y- 32 P-ATP (about 
7000 Ci/mmole) in a 40 u.1 reaction volume containing 70 mM TrisHCI bufler (pH 7.6). 10 mM MgCI 2 , 1 .5 mM spermine 
and 10 mM drthiothreilol for 60 minutes at 37 3 C The total volume was then adjusted to 100 uJ with 25 mM EDTA and 
purified according to the procedure of Maniatis et a!., supra , p 466-467 over a 1 ml spin dialysis column equilibrated 

40 with Tris- EDTA (TE) buffer (10 mM TrisHCI buffer. 0.1 mM EDTA, pH 8.0). TCA precipitation of the reaction product 
indicated that for RS1 8 the specific activity was 4.6 u.Ci/pmole and the final concentration was 0.114 pmole/[il. 

Five microliters of the amplified sample from Section I and of a sample amplified as described above except using 
the Klenow fragment of E. coli DNA Polymerase I instead of the thermostable enzyme were diluted with 195 ul 0 4 N 
NaOH, 25 mM EDTA and spotted onto two replicate nylon filters by first wetting the filters with water placing them in 

45 an apparatus for preparing dot blots which holds the filters in place, applying the samples, and rinsing each well with 
0 4 ml of 20 X SSPE (3 6 M NaCI, 200 mM NaH 2 P0 4 , 20 mM EDTA), as disclosed by Reed and Mann, supra The 
filters were then removed, nnsed in 20 X SSPE, and baked for 30 minutes at 80°C in a vacuum oven. 

After baking, each filter was then contacted with 6 ml of a hybridization solution consisting of 5 X SSPE. 5 x 
Donhardt's solution (1 x = 0 02% polyvinylpyrrolidone, 0 02% Ficoll, 0.02% bovine serum albumin, 0 2 mM Tris, 0.2 

so mM EDTA, pH 8 0) and 0 5% SDS, and incubated for 60 minutes at 55 a C Then 5 uJ of probe RS18 was added to the 
hybridization solution and the filter was incubated for 60 minutes at 55°C 

Finally, each hybridized filter was washed twice with 100 ml of 2 X SSPE and 0.1% SDS for 10 minutes at room 
temperature. Then the filters were treated twice more with 100 ml of 5 X SSPE, 0 1% SDS at 60°C for 1) one minute 
and 2) three minutes, respectively. 

ss Each filter was then autoradiographed with the signal readily apparent after 90 minutes 

In the agarose gel analysis, 5 uJ each amplification reaction was loaded onto 4% NuSieve/0 5% agarose gel in 1 
x TBE buffer (0 089 M Tris. 0 089 M boric acid, and 2 mM EDTA) and elect rophoresed for 60 minutes at 100V After 
staining with cthidium bromide. DNA was visualized by UV fluorescence. 
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The results show that the machines used in Example i and this example were equally effective in amplrfymg the 
DNA, showing discrete high-intensity 1 1 0-base pair bands of similar intensity, corresponding to the desired sequence, 
as well as a few other discrete bands of much lower intensity. In contrast, the amplification method that involves reagent 
transfer after each cycle using the Klenow fragment of E_ coli Polymerase I gave a DNA smear resulting from the non- 
5 specific amplification of many unrelated DNA sequences 

It is expected that similar improvements in amplification and detection would be achieved in evaluating HLA-DQ 
DR and DP regions 

If in the above experiments the amplification reaction buffer contains 2 mM MgC^ instead of 10 mM MgCI 2 and 
150-200 U.M of each nucleotide rather than 1 5 mM of each, and if the lower temperature of 37°C is raised to 45-58*C 
10 during amplification better specificity and efficiency of amplification occurs. Also, DMSO was found not necessary or 
preferred for amplification 

EXAMPLE III 

>5 Amplification and Cloning 

For amplification of a 119-base pair fragment on the human p-globin gene, a total of 1 microgram each of human 
genomic DNA isolated from the Molt 4 cell line or from the GM2064 cell line (representing a homozygous deletion of 
the p- and S-hemoglobin region and available from the Human Genetic Mutant Cell Depository, Camden, NJ) as de- 
20 scribed above was amplified in a 100 u.l reaction volume containing 50 mM KCI, 25 mM Tris HCI pH 8, 10 mM MgCI^ 
200 u,g/ml gelatin, 5 mM 2-mercaptoethanol, 1.5 mM each of dATP, dCTR TTP, and dGTP, and 1 u.M of each of the 
following primers: 

25 5 ' -CTTCTGcagCAACTGTGTTCACTAGC-3 ' ( GH18 ) 

5 ' -CACaAgCTTCATCCACGTTCACC-3 ' ( GH19 ) 

where bwer case letters denote mismatches from wild-type sequence to create restriction enzyme sites. GH18 is a 

26-base oligonucleotide complementary to the negative strand and contains an internal Pstl site. GH19 is a 29-baso 
30 oligonucleotide complementary to the plus strand and contains an internal Hindlll recognition sequence These primers 

were selected by first screening the regions of the gene for homology to the Pstl and Hind lll restriction sites. The 

primers were then prepared as described in Example I. 

The above reaction mixtures wore heated for 10 minutes at 95 a C and then cooled to room temperature. A total of 

4 (il of the polymerase described in Example I was added to each reaction mixture, and then each mixture was overlayed 
5£ with mineral oil. The reaction mixtures were subjected to 30 cycles of amplification with the following program: 

2.5 min. ramp, 37 to98°C 
3 mm. ramp, 98 to 37°C 
2 mm. soak, 37°C 

40 

After the last cycle, the reaction mixtures were incubated for 20 minutes at 65°C to complete the final extension 
The mineral oil was extracted with chloroform and the mixtures were stored at -20° C. 

A total of 1 0 ul of the amplified product was digested with 0. 5 |ag M1 3mp1 0 cloning vector, which is publicly available, 
in a 50 u.l volume containing 50 mM NaCI, 10 mM Tris HCI, pH 7.8, 10 mM MgCI 2 , 20 units Pstl and 26 units Hindlll 
45 for 90 minutes at 37*C. The reaction was stopped by freezing at -20°C. The volume was adjusted to 110 uJ with TE 
buffer and loaded (100 uJ) onto a 1 ml BioGel P-4 spin dialysis column One 0 1 ml fraction was collected and ethanol 
precipitated 

(At this point it was discovered that there was p-globin amplification product in the GM2064 sample. Subsequent 
experiments traced the source of contamination to the primers, either GH18 or GH19 Because no other source of 
so primers was available, the experiment was continued with the understanding that some cloned sequences would be 
derived from the contaminating DNA in the primers ) 

The ethanol pellet was resuspended in 15 uJ water, then adjusted to 20 u.l volume containing 50 mM Tris HCI pH 
7.8, 10 mM MgCI 2 , 0 5 mM ATP, 10 mM dithiothrettol, and 400 units ligase, [One unit is the amount of enzyme required 
to give 50% ligation of Hind lll digested XDNA in 30 minutes at 16°C in 20 uJ at a 5' termini concentration of 0 12 mM 
55 (about 330 j-ig/mi)] This mixture was incubated for three hours at 16°C 

Ten microliters of ligation reaction mixture containing Molt 4 DNA was transformed into E. coli strain JM103 com- 
petent cells, which are publicly available. The procedure followed for preparing the transformed strain is described in 
Messing, J (1961) Third Cleveland Symposium on Mac romolecu I es: Recombinant DNA, ed A Walton, Elsevier, Am- 
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sterdam, 143-163 A total of 651 colorless plaques (and 0 blue plaques) were obtained. Of these, 119 had a (+)- strand 
insert (18%) and 1 9 had a (-)- strand insert (3%). This is an increase of almost 20-fold over the percentage of p-globin 
positive plaques among the primer-positive plaques from the amplification technique using Klenow fragment of E. coli 
Polymerase I. where the reaction proceeded for two minutes at 25 9 C, after which the steps of heating to 1 00°C for two 
5 minutes, cooling, adding Klenow fragment, and reacting were repeated nine times These resutts confirm the improved 
specificity of the amplification reaction employing the thermostable enzyme herein 

In a later cloning experiment with GM2064 and the contaminated primers, 43 out of 510 colorless plaques (8%) 
had the (f)- strand insert This suggests that approximately one-half of the 119 clones from Molt 4 contain the contam- 
inant sequence 

10 Ten of the (+)- strand clones from Molt 4 were sequenced. Frve were normal wild-type sequence and five had a 

single C to T mutation in the third position of the second codon of the gene (CAC to CAT) Four of the contaminant 

clones from GM2064 were sequenced and all four were normal. 

Restriction site-modified primers may also be used to amplify and clone and partially sequence the human N+as 

oncogene and to clone base pair segments of the HLA DQ-o, DQ-p and DR-p genes using the above technique 
is Again, if the concentrations of MgCI 2 and nucleotides are reduced to 2 mM and 1 50-200 u.M, respectively, and the 

minimum cycling temperature is increased from 37°C to 45-58°C, the specificity and efficiency of the amplification 

reaction can be increased. 

EXAMPLE IV 

20 

Gene Retrieval 

A. IDENTIFICATION OF A DNA SEQUENCE PROBE FOR THE TAQ POLYMERASE GENE 

2B A specific DNA sequence probe for the Taq pol gene was obtained following immunological screening of a Xgtl 1 

expression library T aquaticus DNA was digested to completion with Ajul, ligaled with EcoRI 12-mer linkers (CCG- 
GAATTCCGG, New England Biolabs), digested with EcoRI and ligated with dephosphorylated. EcoRI -digested \gt11 
DNA (Promega Biotech). The ligated DNA was packaged (Gigapack Plus, Stratagene) and transfected into E_ coji K- 
12 strain Y1090 (provided by R. Young). 

30 The initial library of 2 X 10 5 plaques was screened (Young, R A , and R W Davis (1983) Science , 222 778-782) 

with a 1:2000 dilution of a rabbit polyclonal antiserum raised to purified Taq polymerase (see Examples I and VI). 
Candidate plaques were replated at limiting dilution and rescreened until homogeneous (-3 cycles) Phage were 
purified from candidate plaques which failed to react with preimmune serum and reacted with immune serum. 

Candidate phage were used to lysogenize E_ coli K-12 strain Y1089 (R. Young). Lysogens were screened tor the 

35 production of an IPTG inducible fusion protein (larger than p-galactosidase) which reacted with the Taq polymerase 
antiserum Sold phase, size-fractionated fusion proteins were used to affinity purify epitope-specific antibodies from 
the total polyclonal antiserum (Goldstein, L.S.B., et al. (1986) J. Cell Biol. 102:2076-2087). 

The "fished", eprtope-selected antibodies were used, in turn, in a Western analysis to identify which \gt11 phage 
candidates encoded DNA sequences uniquely specific to Taq polymerase. One Xgt11 phage candidate designated 

40 Xgt: 1 . specifically selected antibodies from the total rabbit polyclonal Taq polymerase antiserum which uniquely reacted 
with both purified Taq polymerase and crude extract fractions containing Taq polymerase. This phage, >.gt: 1 , was used 
for further study 

The -115 bp EcoRI-adapted Alu l fragment of Thermus aquaticus DNA was labeled (Maniatis et al., supra) to 
generate a Taq polymerase-specrfic probe. The probe was used in Southern analyses and to screen a T aquaticus 
45 DNA random genomic library. 

B CONSTRUCTION AND SCREENING OF A THERMUS AQUATICUS RANDOM GENOMIC LIBRARY 

Lambda phage Charon 35 (Wilhelmine, A. M. et al.. supra ) was annealed and ligated via its cohesive ends, digested 
50 to completion with Bam HI, and the annealed arms were purified from the "stutter* fragments by potassium acetate 
density gradient ultracentnfugation (Maniatis et al., supra ) T aquaticus DNA was partially digested with Sau3A and 
the 15-20 kb size fraction purified by sucrose density gradient ultracentnfugation The random genomic library was 
constructed by ligatmg the target and vector DNA fragments at a 1 1 molar ratio The DNA was packaged and trans- 
fected into E_ coli K-12 strains LE392 or K802. A library of >20,000 initial phage containing >99% recombinants was 
56 amplified on E_ coli K-12 strain IE 392 

The CH35 Taq genomic phage library was screened (Maniatis et al.. supra ) with the radiolabeled EcoRI insert of 
Xgt 111 Specifically hybridizing candidate phage plaques were purified and further analyzed. One phage, designated 
Ch35 4-2 released > four T. aquaticus DNA fragments upon digestion wrth Hind lll (-8.0. 4 5, 0 6. 0.53 kb) 
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The tour Hind i H T aquaticus DNA fragments were ligated with Hind i! I digested plasmid BSM13+ (3 2 kb, Vector 
Cloning Systems, San Diego) and individually cloned following transformation of E. coli K-1 2 strain DG98. 

The -8.0 kb Hind lll DNAf ragment from CH35 :4-2 was isolated in plasmid pFC82 (11.2 kb), while the 4 5 kb Hindlll 
DNA fragment from CH35 4-2 was isolated in plasmid pFC83 (7.7 kb). 
5 E coli strain DG98 harboring pFC82 was shown to contain a thermostable, high temperature DNA polymerase 

activity (Table 1) In addition, these cells synthesize a new -60 kd molecular weight polypeptide which is immunolog- 
ically related to Taq DNA polymerase. 

The Taq polymerase coding region of the 8.0 kb Hind lll DNA fragment was further localized to the lac-promoter 
proximal 2 8 kb Hind lll to Asp_718 portion of the 8 0 kb Hind lll fragment. This region was subcloned to yield plasmid 
10 pFCS5 (6 0 kb) Upon induction with IPTG, E_ coli DG98 cells harboring plasmid pFC85 synthesize up to 100-fold more 
thermostable, Taq polymerase-related activity (Table 1 ) than the original parent clone (pFC82/DG98). While cells har- 
boring pFC85 synthesize a significant amount of a thermostable DNA polymerase activity, only a portion of the Taq pol 
DNA sequence is translated, resulting in the accumulation of a -60 kd Taq polymerase-related polypeptide 

15 TABLE 1 



Expression of a Thermostable DNA Polymerase Activity in E. coli* 


Sample 


Units Vml 


-IPTG 


f IPTG 


BSM13/DG98 




0.02 


pFC82/DG98 


2.2 


2.7 


pFC85/DG98 


11.9 


643.3 



* Cells were grown to late log phase (+/- IPTG, 1 0 mM), harvested, sonicated, heated at 75*C for 20 minutes, centrifuged and the clarified supernatant 
assayed at 7CC for DNA polymerase activity. 

* 1 unit = 1 nM dCTP incorporated in 30 minutes. 



EXAMPLE V 



Expression of Taq Polymerase 

The gene encoding the thermostable polymerase of the present invention can be expressed in any of a variety of 
bacterial expression vectors including DG141 (ATCC 39588) and pP L N RBS ATG. Both of these host vectors are pBR322 
derivatives that have either a sequence containing a tryptophan promoter-operator and nbosome binding site with an 
operably linked ATG start codon (DG141) or a sequence containing the lambda P L promoter and gene N ribosome 
binding site operably linked to an ATG start codon (pP L N RBS ATG). Either one ol these host vectors may be restricted 
with Sacl, and blunt ended with Klenow or SI nuclease to construct a convenient restriction site for subsequent insertion 
of the Taq polymerase gene 

The full-length Taq polymerase gene was constructed from the DNA insert fragments subcloned into plasmids 
pFC83 and pFC85 as follows. Vector BSM13+ (commercially available from Vector Cloning Systems, San Diego, CA) 
was digested at the unique Hind lll site, repaired with Klenow and dNTPs, and ligated with T4 DNA ligase to a Bglll 
octanucleotide linker, 5'-CAGATCTG-3\ and transformed into E coli strain DG98 Plasmids were isolated from Amp R 
JacZa + transformants. One of the clones was digested with Bglll and Asp 718 restriction enzymes, and the large vector 
fragment purified by gel electrophoresis 

Next, plasmid pFC83 was digested with BoJII and Hind lll and the -750 base pair fragment was isolated. Plasmid 
pFC85 was digested with Hind lll and Asp/ 18 and the -2 8 kb fragment isolated and joined in a three-piece ligation 
to the -750 base pair Bglll- Hmd lll fragment from pFC83 and the Bglfl-Asp71 8 vector fragment of BSM1 3*. This ligation 
mixture was used to transform E. coli strain DG98 (ATCC 39,768 deposited July 1 3. 1984) from which Amp R colonies 
were selected and an -6 75 kilobase plasmid (pLSG1) was isolated. lsopropyl-p-D-thiogalactoside (IPTG)-induced 
DG98 cells harboring pLSG1 synthesized Taq DNA polymerase indistinguishable in size from the native enzyme iso- 
lated from T aquaticus Plasmid pLSG1 can then be used to generate a single strand DNA template according to the 
procedure recommended by Vector Cloning Systems. 

0!igonucieotidc<Jirccted mutagenesis (see Zoller and Smith, Nuc Acids Res. (1982) 10:6487-6500) can then bo 
used to introduce an Sph l restriction site as part of the ATG start codon (upstream of the internal Hind lll site in the 
coding sequence of the Taq polymerase gene) Similarly, a Bglll site can be introduced after the carboxyl-terminus of 
the gene (-0 7 kb upstream from the Asp 71 8 site) to facilitate subcloning of the Taq polymerase gene into an expression 
vector After the sitc-dirccted mutagenesis is performed, the gene can bo isolated Irom the BSM1 3 + vector on an —3.2 
kb Sph l-BstFH restriction fragment, treated with Klenow fragment and all four dNTPs, and inserted with T4 DNA ligase 
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(blunt-end conditions) into either one of the aforementioned expression vectors, which have been digested with Sac I. 
repaired with Ktenow and dNTPs, and treated with calf intestine phosphatase to generate dephosphorylated blunt 
ends. This ligation mixture is used to transform E. coli DG116 and the resulting transformants are screened for pro- 
duction of Taq polymerase. Expression of the enzyme can be confirmed by Western immunoblot analysis and activity 
5 analysis 

A greater proportion of the Taq polymerase gene contained within the -2 8 kb Hindlll -Asp71 8 fragment of plasmid 
pFC65 can be expressed using for example, plasmid pP L N RBS ATG, by operably linking the amino-terminal Hind lll 
restriction site encoding the Taqpoj gene to an ATG initiation codon. The product of this fusion upon expression will 
yield an -66,000-68,000 dalton truncated polymerase 

io This specific construction can be made by digesting plasmid pFC85 with Hind lll and treating with Klenow fragment 

in the presence of dATR dGTP and dCTP. The resulting fragment is treated further with SI nuclease to remove any 
single-stranded extensions, and the resulting DNA digested with Asp718 and treated with Klenow fragment in the 
presence of all four dNTPs The recovered fragment can be ligated using T4 DNA ligase to dephosphorylated plasmid 
pP L N RBS ATG, which had been digested with Sad and treated with Klenow fragment in the presence ol dGTP to con- 

f5 struct an ATG blunt end. This ligation mixture can then be used to transformed E coli DG116 and the transformants 
screened for production of Taq polymerase Again, expression can be confirmed by Western immunoblot analysis and 
activity analysis. 

EXAMPLE VI 

20 

Purification 

The thermostable polymerase may be purified directly from a culture of Thermus aquaticus following the example 
disclosed below or, alternatively, from a bacterial culture containing the recombinantfy produced enzyme with only 
2$ minor modifications necessary in the preparation of the crude extract 

After harvesting by centrifugation, 60 grams of cells were resuspended in 75 ml of a buffer consisting of 50 mM 
Tris-CI pH 8 : 1 mM EDTA Cells were lysed in a French Press at 1 4,000-1 6 000 PSI after which 4 volumes (300 ml) of 
additional Tr is- EDTA were added. Buffer A (p-mercaptoethanol to 5 mM and NP-40 and Tween 20 to 0.5% (v/v) each) 
was added and the solution was sonicated thoroughly while cooling The resultant homogeneous suspension was 
30 diluted further with Bufler A such that the final volume was 7 5-8 times the starting cell weight; this was designated 
Fraction I. 

The polymerase activity in Fraction I and subsequent fractions was determined in a 50 uJ mixture containing 0.025 
M TAPS-HCI pH 9.4 (20°C), 0.002 M MgCI 2 , 0 05 M KCI, 1 mM 2-mercaptoethanol, 0.2 mM each dGTP dATR TTR 
0.1 mM dCTP [a- 32 P, 05 Ci/mM], 1 2 5 u.g "activated" salmon sperm DNA and 0 01 -0.2 units of the polymerase (diluted 

35 in 10 mM Tris-HCI, pH 8, 50 mM KCI, 1 mg/ml autoclaved gelatin, 0.5% NP-40, 0.5% Tween 20, and 1 mM 2-mercap- 
toethanol) One unit corresponds to 10 nM product in 30 minutes. "Activated" DNA is a native preparation of DNA after 
partial hydrolysis with DNase I until 5% of the DNA was transferred to the acid-soluble fraction. The reaction was 
conducted at 74°C for 10 minutes and then 40 pi was transferred to 1 0 ml of 50 u.g/ml carrier DNA in 2 mM EDTA at 
0°C. An equal volume (1.0 ml) of 20% TCA, 2% sodium pyrophosphate was added. After 15-20 minutes at 0°C the 

40 samples were filtered through Whatman GF/C discs and extensively washed with cold 5% TCA-1% pyrophosphate, 
followed by cold 95% ethanol, dried and counted. 

Fraction I was centrif uged for two hours at 35,000 rpm in a Beckman Tl 45 rotor at 2°C and the collected supernatant 
was designated Fraction II 

The Taq polymerase activity was precipitated with Polymin P (BRL, Gaithersburg, MD) (10%, w/v, adjusted to pH 
7.5 and autoclaved) after the minimum amount of Polymin P necessary to precipitate 90-95% of the activity was de- 
termined, which amount was generally found to be between 0 25% and 0 3% final volume 

An appropriate level of Polymin P was added slowly to Fraction II while stirring for 1 5 minutes at 0°C This solution 
was centrifuged at 13,000 rpm for 20 minutes in a Beckman JA 14 rotor at 2°C. The supernatant was assayed for 
activity and the pellet was resuspended in 1/5 volume of 0 5X Buffer A (diluted 1 :2 with H 2 0). This suspension was 
so recentrifuged and the pellet resuspended in 1/4 volume of Buffer A containing 0 4 M KCI This suspension was ho- 
mogenized thoroughly and left overnight at 4°C The homogenate was centnfuged as above and the collected super- 
natant designated Fraction III 

The protein fraction was collected by "precipitation" at 75% saturation of ammonium sulfate, centrifuged (at 27,000 
rpm, SW27 rotor, 30 minutes) and the floating pellicle was resuspended in 50 mM Tris-CI pH 8, 1 mM EDTA These 
ss steps were repeated and the protein suspension was diaiyzed extensively with P-cell buffer (20 mM KP0 4 pH 7 5, 0 5 
mM EDTA, 5 mM P-mercaptoethanol, 5% (w/v) glycerol. 0 5% (v/v) NP-40 and Tween 20) containing 80 mM KCI. 

The diafysate was transferred 1o a centrifuge bottle to which was added any recovered protein from sacks rinsed 
with the P cell buffer containing 80 mM KCI Centrifugation was performed at 20,000 x g and the time was reduced to 
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1 5 minutes The supernatant was saved and any pellet remaining was washed, extracted wrth P-ce\\ buffer and BO mM 
KCl and recent rrfuged. The supernatants were then combined to form Fraction IV 

Fraction IV was applied to a 2 2 x 22-cm column of phosphocellulose, equilibrated with the P-cell buffer containing 
80 mM KCl. The column was washed (2.5-3 column volumes) with the same buffer and the protein eluted using a linear 

5 gradient of 60 to 400 mM KCl in P-cell buffer Fractions containing DNA polymerase activity (-0.16-0.20 M KCl) were 
pooled and concentrated 3-4 fold on an Amicon stirred cell and YM30 membrane The cell was rinsed wrth the P-cell 
buffer wrthout KCl and added to the fraction concentrate (0.15 M KCl adjusted final volume) to form Fraction V 

Fraction V was applied to a 5 ml Heparin Sepharose CL-6B column (Pharmacia) equilibrated with P-cell buffer 
and 0 15 M KCl The column was washed with 0 15 M KCl buffer (3-4 column volumes) and the protein eluted with a 

10 linear gradient from 0 1 5 to 0 65 M KCl in P-cell buffer. A 1 : 1 0 diluton into diluent wrthout gelatin was made for SDS- 
PAGE analysis and a subsequent 1 20 dilution into diluent with 1 mg/ml gelatin was made for use in enzyme assays, 
The activity fractions (eluting at ~0.3 M KCl) were assayed on supercoiled DNA template for specific and non-specific 
endonucleases/lopoisom erase by elect rophoretically detecting the change in molecular weight of supercoiled plasmid 
DNA after incubation with an excess of DNA polymerase. Exonuc lease contamination was detected following incubation 

f 5 wrth small linear DNA fragments. In peak fractions, an -88 kd protein was found to be the major band. The major pool, 
designated Fraction VI, had the highest polymerase activity with minimal detectable endonuclease activity when this 
pool was assayed for 30 minutes at 55°C with -3-5 pofymerase units/600 ng DNA. 

Fraction VI was diafyzed against 10 mM KP0 4 pH 7.5, 5 mM p-mercaptoethanol, 5% glycerol, 0.2% NP-40, and 
0.2% Tween 20 (HA buffer). The dialyzed sample was applied to a 3 ml column of hydroxy apatite and the enzyme 

20 eluted with a linear gradient of 10 to 250 mM KP0 4 pH 7 5, HA buffer. DNA polymerase activity began to elute at 75 
mM KP0 4 with the peak at 100 mM KP0 4 . Active peak fractions were assayed at 1 100-1 :300 diluton. As in the prior 
chromatography step, a 1 10 dilution in diluent was prepared wrthout gelatin for SDSPAGE analysis Fractions with no 
significant endonuclease or double-strand exonuclease when assayed at 55°C with 5 polymerase units were pooled 
and designated Fraction VII. 

25 Fraction VII was dialyzed against a solution of 25 mM sodium acetate pH 5.2, 5% glycerol, 5 mM p-mercaptoeth- 

anol, 01 mM EDTA, 0 1% NP-40 and 0 1% Tween 20, adjusted to pH 5 at room temperature The dialyzed sample 
was applied to a 2 ml DEAE-Tris-Acryl-M (LKB) column pre-equilibrated and subsequently washed with the same 
buffer. The fraction containing polymerase activity thai did not adhere to the column was pooled and adjusted to 50 
mM NaCI in the same buffer to yield Fraction VIII. 

30 Fraction VIII was applied to a 2 ml CM-Tris-Acryl M (LKB) column equilibrated with the same buffer (25 mM sodium 

acetate, 50 mM NaCI, 5% glycerol, 0 1 mM EDTA, 0.1% NP-40, and 0.1% Tween 20) The column was washed with 
4-5 column volumes of the same buffer and the enzyme eluted with a linear gradient from 50 to 400 mM NaCI in sodium 
acetate buffer The polymerase activity peak eluted ~0. 15-0.20 M NaCI. The polymerase activity was assayed at 1 
300 to 1 500 dilution with the first dilution 1:10 into diluent wrthout gelatin for the SDS-PAGE analysis An assay across 

35 the activity peak on supercoiled DNA templates for specific and non-specific endonuclease/topoisomerase using DNA 
polymerase assay salts (25 mM TAPS-HCI pH 9 4, 2.0 mM MgCI 2 and 50 mM KCl) at 74 9 C was performed, as well as 
assays for nucleases on M13 ss DNA and pBR322 fragments. Active fractions with no detectable nuclease(s) were 
pooled and run on a silver stained SDS-PAGE mini gel The results show a single -68 kd band with a specific activity 
of -250,000 units/mg. 

40 This specific activity is more than an order of magnitude higher than that claimed for the previously isolated Taq 

polymerase and is at least an order of magnitude higher than that for_E_ coli polymerase I. 

EXAMPLE VII 

45 The Taq polymerase purified as described above in Example VI was found to be free of any contaminating Taq 

endonuclease and exonuclease activities In addition, the Taq polymerase is preferably stored in storage buffer con- 
taining from about 0 1 to about 0 5% volumeA^olume of each non-ionic potymeric detergent employed More preferably 
the storage buffer consists of 50% (v/v) glycerol, 1 00 mM KCl, 20 mM Tris-CI pH 8 0, 0. 1 mM ethylenediamineletraacetic 
acid (EDTA), 1 mM dithiothreitol, 0 5% v/v NP-40 0.5% v/v Tween 20, and 200 jac/ml gelatin, and is preferably stored 

so at -20° C 

The stored Taq polymerase was diluted in a buffer consisting of 25 mM Tns CI pH 8.0. 20 mM KCl, 1 mM (3- 
mercaptoethanol. 0 5% NP-40, 0 5% Tween-20, and 500 (ag/ml gelatin A reaction buffer was then prepared containing 
50mMKCI, 10 mM Tris-CI. pH 8.3. 1 5 mM MgCI 2 , 0 01% (w/v) gelatin 200 u.M each dNTP, 1 u^M each of the primers 
that define a 500 base pair target sequence on a control templato from bacteriophage X. and 2.0-2.5 units Taq potymor- 
55 ase/assay in a final volume of 100 ul Template was added to the reaction buffer, the sample placed in a 0 5 ml poly- 
propylene tube, and the sample topped with 100 uJ of heavy white mineral oil to prevent evaporation. 

Al least a 10 5 -fold amplification was achieved when the following conditions were employed, using 1 ng of control 
template (bacteriophage >. DNA) where the target sequence represented approximately 1 % of the starting mass of DNA 
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First the template mixture was denatured for one minute, 30 seconds at 94*C by placing the tube in a heal bath 
Then the tube was placed in a heat bath at 37°C for two minutes Then the tube was placed in a heat bath at 72°C for 
three minutes, and then in the heat bath at 94°C for one minute. This cycle was repeated for a total of 25 cycles. At 
the end ot the 25th cycle, the heat denaturation step at 94°C was omitted and replaced by extending the 72° C incubation 
5 step by an additional three minutes. Following termination of the assay, the samples were allowed to cool to room 
temperature and analyzed as described in previous examples 

The template may be optimally amplified with a different concentration of dNTPs and a different amount of Taq 
polymerase. Also, the size of the target sequence in the DNA sample will directly impact the minimum time required 
for proper extension (72°C incubation step) An optimization of the temperature cycling profile should be performed 
w for each individual template to be amplified, to obtain maximum efficiency 

EXAMPLE VIII 

Taq polymerase punfied as described above in Example I was formulated for storage as described in the previous 
15 example, but without the non-ionic polymeric detergents. When assayed for activity as described in that example, the 
enzyme storage mixture was found to be inactive When the NP-40 and Tween 20 were added to the storage buffer, 
the full enzyme activity was restored, indicating that the presence of the non-ionic detergents is necessary to the 
stability of the enzyme formulation. 

20 EXAMPLE IX 

Several 1 ug samples of human genomic DNA were subjected to 20-35 cycles of amplification as described in 
Example V, with equivalent units of either Klenow fragment or Taq polymerase, and analyzed by agarose gel electro- 
phoresis and Southern bbt. The primers used in these reactions, PC03 and PC04, direct the synthesis of a 110-bp 

2S segment of the human beta-globin gene. The Klenow polymerase amplifications exhibited the smear of DNA typically 
observed with this enzyme, the apparent cause of which is the non-specilic annealing and extension of primers to 
unrelated genomic sequences under what were essentially non-stringent hybridization conditions (1x Klenow salts at 
37°C). Nevertheless, by Southern blot a specific 110-bp beta-globin target fragment was detected in all lanes. A sub- 
stantially different electrophoretic pattern was seen in the amplifications done with Taq polymerase where the single 

30 major band is the 110-bp target sequence This remarkable specificity was undoubtedly due to the temperature at 
which the primers were extended. 

Although, like Klenow fragment amplifications, the annealing step was performed at 37°C, the temperature of Taq- 
catalyzed reactions had to be raised to about 70°C before the enzyme exhibited significant activity During this transition 
Irom 37 to 70°C, poorly matched primer-template hybrids (which formed at 37°C) disassociated so that by the time 

35 the reaction reached an enzyme-activating temperature, only highly complementary substrate was available for exten- 
sion This specificity also results in a greater yield of target sequence than similar amplifications done with Klenow 
fragment because the non-specific extension products effectively compete for the polymerase thereby reducing the 
amount of 110-mer that can be made by the Klenow fragment 

40 EXAMPLE X 

Amplification was carried out of a sample containing 1 fig Molt 4 DNA. 50 mM KCI, 10 mM Tris pH 8 3, 10 mM 
MgCI 2 , 0.01% gelatin, 1 u.M of each of the following primers (to amplify a 150 bp region) 

5'-CATGCCTCTTTGCACCATTC-3'(R$79) and 
5 ' -TGGTAGCTGGATTGTAGCTG-3 ' ( RS80 ) 

1.5 mM of each dNTP. and 5 0 units of Taq polymerase per 100 uJ reaction volume Three additional samples woro 
so prepared containing 2 5, 1 3, or 0 6 units of Taq polymerase The amplification was carried out in the temperature 
cycling machine described above using the following cycle, for 30 cycles 

from 70 to 98°C for 1 minute 
hold at 98°C for 1 minute 
55 irom 98°C to 35 45 or 55*0 for 1 minute 

hold at 35, 45 or 55°C for 1 minute 
from 35 45 or 55°C to 70 9 C for 1 minute 
hold at 70°C for 30 seconds 
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At 35°C annealing temperature, the 2 5 units/100 ul Taq enzyme dilution gave the best-$ignal-to noise ratio by 
agarose gel electrophoresis over all other Taq polymerase concentrations. At 45° C, the 5 unrts/100 u.i Taq enzyme 
gave the best signal-to-noise ratio over the other concentrations. At 55°C, the 5 unrts/1 00 u.l Taq enzyme gave the best 
signal-to-noise ratio over the other concentrations and over the 45°C annealing and improved yield The Taq polymer- 
5 ase has more specificity and better yield at 55'C 

In a separate experiment the Molt 4 DNA was 10-fold serially diluted into the cell line GM2064 DNA, containing 
no p- or 5-globin sequences, available from the Human Genetic Mutant Cell Depository, Camden, New Jersey at 
various concentrations representing varying copies per cell, and amplification was carried out on these samples as 
described in this example at annealing temperatures of 35°C and 55°C At 35°C, the best that can be seen by agarose 
ic gel electrophoresis is 1 copy in 50 cells At 55°C the best that can be seen is 1/5,000 cells (a 100-lold improvement 
over the lower temperature), illustrating the importance of increased annealing temperature 1or Taq polymerase spe- 
cificity under these conditions. 

In a third experiment, DNA from a cell line 368H containing HIV-positive DNA, available from B. Poiesz, State 
University of New York, Syracuse, NY, was similarly diluted into the DNA from the SC1 cell line (deposited with ATCC 
'5 on March 1 9, 1 985, an EBV-transformed P cell line homozygous for the sickle cell allele and lacking any HIV sequences) 
at various concentrations representing varying copies per cell, and amplification was earned out as described in this 
Example at annealing temperatures of 35°C and 55*C, using the primers SK38 and SK39. which amplify a 115 bp 
region of the HIV sequence 

20 

5' -ATAATCCACCTATCCCAGTAGGAGAAAT-3 1 ( SK38) and 
5 ' -TTTGGTCCTTGTCTTATGTCCAGAATGC-3 ' ( SK39 ) 

2S The results by agarose gel electrophoresis showed that only the undiluted 368H sample could be detected with 

the annealing temperature at 35°C, whereas at least a 10" 2 dilution can be detected with the annealing temperature 
at 55°C, giving a 100-fold improvement in detection. 

EXAMPLE XI 

30 

cDNA was made from 1 jag of rabbit reticulocyte mRNA (Bethesda Research Laboratories) in a 100 uJ reaction 
volume containing 150 mM KC1 t 50 mM Tris HCI (pH 8.3), 10 mM MgCI^ 5 mM DTT, 0.5 mM dATP 0.5 mM dCTP 
0.5 mM TTP, 0.5 mM dGTP 0.2 fig oligo(dT)12-18 (Pharmacia). 40 units RNasin (Promega Biotec), and 5 units AMV 
reverse transcriptase (BRL) and incubated for 30 minutes at 42°C The reaction was stopped by heating for 10 minutes 
os at 95°C. Two |ig RNase A was added to the sample (2 uJ of a 2 mo/ml solution in water) and incubated for 10 minutes 
at 37°C. 

Three amplification reactions were done with the Klenow fragment using different pairs of primers. The primer pair 
PC03/PC04 define a 110-bp product The primer pair RS45/oligo(dT)25-30 define an about 370-bp product, and the 
primer pair PC0^oligo(dT)25-30 an about 600-bp product PC03 PC04, and RS45 are complementary to the human 
40 |}-globin gene and each has two mismatches with the rabbit gene. PC03 and PC04 are described in Example I RS45 
has the sequence: 5'-CAAGAAGGTGCTAGGTGCC-3\ 

The amplification reactions were performed with 1/20th (5 uJ) of the cDNA described above in a 100 u.l reaction 
volume containing 50 mM NaCI, 10 mM TrisHCI (pH 7.6), 10 mM MgCI^ 200 ug/ml gelatin, 10% DMSO, 1 (iM PC03 
or RS45, 1 uJvl PC04oroligo(dT)25-30, 1.5 mM dATP, 1.5mMdCTP, 1.5 mM TTP and 1 5 mM dGTP The samples 
45 were heated for five minutes at 98'C, then cooled to room temperature and overlayed with 100 \i\ mineral oil 

The samples were subjected to 10 cycles of automated amplification using the machine described in Example I 
and using the following program 

1) heating from 37° C to 98°C in a heating block over 2.5 minutes (denature); 
so 2) cooling from 98°C to 37°C over 3 0 minutes (anneal) 

3) adding 1 unit Klenow fragment; and 

4) maintaining at 37°C for 20 minutes (extend) 

The final volume of each samplo was about 140 ul 
55 One-twentieth (7 ul) of each sample was analyzed by electrophoresis on a 2% agarose gel After staining with 

ethidium bromide, discrete bands were seen in the PC03/PC04 and RS45/oligo(dT) samples. The sizes of the bands 
were consistent with the expected lengths 110-bp for the former about 370-bp for the latter. No evidence of amplifi- 
cation of an about 600 bp fragment with the PC0a / oligo(dT) primer pair was observed 
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The contents of The gel were Southern blotted onto a Genatran nylon membrane and hybridized with a nick-trans- 
lated human (i-globin probe, pBR328:betaA, described in Saiki et aL Science, supra , using standard techniques. The 
resulting autoradiogram extended the conclusions reached previously - the 110 and about 370-bp fragments were (3- 
globin specific amplification products and no significant amplification of the about 600-bp band was detected. 
5 Three additional samples were amplified with the Taq polymerase obtained as described above using the same 

primer pairs described previously Five microliter portions of cDNA were amplified in 100uJ reaction volumes containing 
50 mM KCI, 25 mM TnsHCI (pH 8.0), 10 mM MgCI 2 , 200 \xgfm\ gelatin, 10% DMSO, 1 uM PC03 or RS45 1 u.M PC04 
or oligo-(dT)25-30, 1 .5 mM dATR 1 .5 mM dCTR 1 .5 mM TTP and 1 .5 mM dGTP The samples were heated for five 
minutes at 98°C, then cooled to room temperature One microliter of Taq polymerase (1/8 dilution of lot 2) was added 
1 0 to each and overlayed with about 100 uJ mineral oil. 

The samples were subjected to 9 cycles of amplification in the Peltier device described in the previous example 
using the following program: 

1) 1 mm , 35 to60°C ramp; 
15 2) 12 min., 60to70°C ramp (extend), 

3) 1 min., 70-95°C ramp (denature), 

4) 30 sec, 95° C soak; 

5) 1 mm., 95 to 35°C ramp (anneal); and 

6) 30 sec, 35 ft C soak. 

20 

After the last cycle, the samples were incubated an additional 10 minutes at 70°C to complete the final (10th cycle) 
extension. The final volume of each was about 100 uJ. 

As before : 1/20th (10 ul) of each sample was analyzed on a 2% agarose gel In this gel, amplification products 
were present in all three samples: 110-bp for PC02UPC04, about 370-bp for RS45/oligo(dT), and about 600-bp for 
25 PC03/oligo(dT). These results were confirmed by Southern transfer and hybridization with the pBR328:betaA probe 

The production of the 600-bp product with Taq polymerase but not with the Klenow fragment is significant, and 
suggests that Taq polymerase is capable of producing longer DNA than the Klenow fragment. 

The following bacteriophage and bacterial strains were deposited with the Cetus Master Culture Collection, 1400 
Fifty-Third Street, Emeryville, California. USA (CMCC) and with the American Type Culture Collection, 12301 Parklawn 
30 Drive, Rockville, Maryland. USA (ATCC) These deposits were made under the provisions of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for purposes of Patent Procedure and the Regulations 
thereunder (Budapest Treaty). This assures maintenance of a viable culture for 30 years from the date of deposit. The 
organisms will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between applicants and ATCC that assures unrestricted availability upon issuance of the pertinent U S patent Avail- 
35 ability of the deposited strains is not to be construed as a license to practice the invention in contravention of the rights 
granted under the authority of any government in accordance with its patent laws. 
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In summary, the present invention is seen to provide a process for amplifying one or more specific nucleic acid 
45 sequences using a temperature-cycled chain reaction and a thermostable enzyme, in which reaction primer extension 
products are produced that can subsequently act as templates for further primer extension reactions The process is 
especially useful in detecting nucleic acid sequences that are initially present in only very small amounts and in detecting 
nucleotide variations using sequence -specific oligonucleotides. Also, the amplification process herein can be used for 
molecular cloning. 

so The process herein results in increased yields of amplified product, greater specificity, and fewer steps necessary 

to carry out the amplification procedure, over what has been previously disclosed 

Claims 

55 

1. A stable enzyme composition comprising a thermostable nucleic acid polymerase enzyme in a buffer comprising 
one or more non-ionic polymeric detergents 
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The composition ol claim 1, wherein the detergents are each present in a concentration of about 0 1% to about 
0.5% volumeyvolume ot the total composition. 

The composition of claim i or 2, wherein the detergents are a polyoxyethylated sorbitan monola urate and an 
ethoxylated alkyl phenol, preferably an ethoxylated nonyl phenol 

The composition of any one of claims 1 to 3, wherein the buffer comprises glycerol, Tris-CI, pH8.0, ethylenediamine 
tetraacetic acid, dithiothroitol. a polyoxyethylated sorbitan monolaurate, an ethoxylated nonyl phenol, and gelatin. 

A method of producing a stable enzyme composition comprising formulating a thermostable nucleic acid polymer- 
ase enzyme in a buffer comprising one or more non-ionic polymeric detergents. 
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